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SOME VARIABLES AFFECTING PATTERN CONCEPTION 


EVA I. SHIPSTONE? 


Harvard University 


- oe approach to the study of concept 


formation is based on the assumption 
that there is an implicit conception of 
structure (patterning behavior) in adult 
human beings and that this can be classified 
into hierarchically organized categories. In 
order to give a more definite conception of 
what this means, we begin with an anno- 
tated protocol recorded for a single sub- 
ject. The procedure described in this 
sample protocol is slightly different from 
that used in the main’ experiments de- 
scribed below; it was chosen, however, in 
order to unravel for the reader in some 
detail the process of strategy construction 
in concept formation. This technique is 
replaced by a simpler and more concise one 
in the experiments to be described later. 


AN ILLUSTRATIVE STUDY 
(Experiment I) 


The subject sits at a table; across from 
him is the experimenter. The subject has 
just finished reading the following instruc- 
tions: 

During the last war an army base was re- 
ceiving important code information, one message 
at a time. This information had to be sorted 
for filing as it was received. Now suppose you 
were the officer on duty with this important job 
of receiving and filing this information. You 
will now be presented with the same informa- 
tion, single message on single card, two cards 
at first, then one card at a time. You are to 
study the card and then decide how you are to 
file it. You can put the cards into as many piles 
or boxes as you wish and you are free to move 


1 The author, now teaching at Isabella Thoburn 
College, Lucknow, India, wishes to acknowledge 
her indebtedness to George A. Miller for directing 
this research. Without his invaluable help this 
thesis could not have been written. Gratitude is 
also expressed to E. B. Newman, Chairman of 
the department, for his suggestions. 


around a card from one box to another any 
time you wish. You will not be limited in time 
since the decisions you make are very important, 
but work rapidly. Read these instructions again 
until you understand them thoroughly and read 
them again in between if you should forget 
them. 

Any questions? 

Now, one further point. As you work with these 
cards, think aloud as if you were talking to your- 
self. Talk freely about what you are doing 
throughout your performance. 

Any questions? 

“Here,” says the experimenter, “are the first two 
cards.” The subject is handed two cards, 5” X 8” 
on which are typewritten: ZIR PAG ZIR ZIR 
PAG NEN; and ZIR PAG NEN PAG NEN 
PAG NEN PAG NEN PAG NEN PAG NEN 
PAG ZIR NEN. Only three nonsense words, 
NEN, PAG, ZIR, arranged in various different 
orders, are typed on any of the cards; it is con- 
venient to abbreviate them, N, P, and Z. The 
subject looks at the two cards, laughs self 
consciously, and places them separately on the 
table in front of him. 

He says, “It seems as though the main difference 
in the way things repeat is first according to the 
symbols that appear and then in the order in which 
they appear.” 

There is a silent pause. The experimenter says, 
“Now you will have to tell me when you are ready 
for the next card.” 

“Next card, please.” 

The next card contains the message ZPZN. The 
subject places it on the table by itself. All three 
cards are separately placed, so that the experi- 
menter asks, “Why did you place that card there?” 

“Oh, well,” the subject says, “there seems to be 
a difference in the lengths of the messages. And 
I don’t have enough information to really decide 
on categories. I am just setting them aside until 
| gather a little more information.” 

The experimenter, who has seen some people 
base their entire classification on the length of 
the messages, makes a special note that the first 
basis of classification mentioned by the subject is 
length. Will he stay with this aspect, or will he 
look further? 

The next card is ZPZZZZZZZZZZZZZPN. The 
subject says, “Now this card is more similar in 
length to one of the other ones I received, and 
it contains a large number of ZIRs as the other 
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one, so I will put these two together, possibly 
in the same category.” He places together 
ZPZZZZZZZZZZZZPN and ZPZZPN;; since he 


does not combine ZPZZZZZZZZZZZZZPN and 
ZPNPNPNPNPNPNPZN, he seems to have re- 
jected length as a basis of classification. For the 
moment, apparently, his attention is caught by 
another aspect, the repeating unit ZIR. 

The next card is ZPNPZN. 
it silently. There is a long pause. 
are thinking,” says the experimenter, 
it aloud. I know that is not easy.” 

“Well,” says the subject, “I just noticed that 
all of the five cards start with the same syllable 
and the syllables appear to be ordered. Similarly, 
the second syllables in all the cards are the same. 
A difference arises in the third syllable, so there 
is a possibility of arranging these cards according 
to the syllables. Alphabetically, in other words. 
But for the time being I will put this card in 
with the one which has similar length, that is, with 
the same number of syllables, like six or so.” 
He proceeds to file them by length, as follows: 
ZPNPZN and ZPZZPN are put together, ZPNPN- 
PNPNPNPNPZN and ZPZZZZZZZZZZZZZPN 
are put together, and ZPZN is ieft by itself. Al- 
though he is beginning to pay attention to the 
way the messages begin, and to consider the 
possibility of an alphabetical ordering, he rejects 
it in favor of a classification on the basis of 
length. He has destroyed his first tentative group 
ing on the basis of a repeating unit and has 
returned to the earlier hypothesis. 

The next card is ZPPN. “Again,” says the 
subject, “all the cards start with the same two 
syllables. But I am going to set this card aside 
for a separate category for no other reason than 
that I want to gather more cards before I make 
a classification.” There is a pause. “Am I to 
assume that this is coded information? I mean, 
information like language information ?” 

“T am sorry I cannot 
These are messages 


The subject reads 
“Whatever you 
“please say 


The experimenter replies, 
answer that question for you. 
you are receiving and you are trying to file them.” 

“Messages are coming in and I am trying to 
organize them. Hmm temporarily, I am 
going to rearrange the cards and put them in 
alphabetical order according to the syllables, start- 
ing from the beginning.” The subject seems to 
hope that something not presently obvious will 
appear as he tries different spatial arrangements 
of the cards. As he works at alphabetizing them, 
he says, “I’ve just noticed something that had not 
occurred to me before, that there are only four 
symbols.” In fact, of course, there are only three 
“Somehow,” he continues, “in the beginning | 
assumed there were going to be more. So, now I 
am arranging them first of all according to sylla- 
bles, that is, the syllable order and the first three 
syllables, and then according to the lengths of 
the messages.” Since the first two syllables are 
identical on all the cards that he has seen, any 


attempt to alphabetize must take into consideration 
at least the first three syllables of the message. 

The experimenter notes that the subject is be- 
ginning to combine his rules in an effort to find 
some new basis for classification. Like most sub- 
jects, this one began with relatively simple rules— 
counting and alphabetizing—ready-made operations 
he has learned in other contexts; he transfers 
them bodily to the present task. For some reason, 
however, the results do not satisfy him and he 
is now beginning to search for something else. 
As he collects more cards, iie wants to come to 
terms with them in a way that respects their 
peculiarities and particularities, not by some ready- 
made rule that might fit almost anything. 

As shown in the chart below, the next card is 
ZPNPNPNPNPZN. The subject studies this 
card in silence for almost 60 seconds, looking back 
and forth between it and other cards spread out 
on the table before him. Then he makes a new 
observation, “I notice now that when I look at 
these cards in categories that there are units within 
each card that seem to repeat between cards. 
Like, for instance, I have two cards here both of 
which start with ZIR PAG NEN and end in 
ZIR NEN, and in between they have the unit 
PAG NEN, but in slightly different numbers. I 
mean, one card will have more of these units than 
another card. It seems like relationships are 
starting to emerge.” 

The more complex rule that the subject has 
just discovered is one that the experimenter has 
watched many subjects come to; the experimenter 
calls such formulations “Initial-Middle-Final Rule.” 
Although the rule is here applied to only two 
cards to form a single category, it contains the 
germ of the filing system that this subject will 
use for all the remaining cards. Note that it is 
a rule designed to fit the present situation and to 
use the particular information provided by this 
set of messages; it is not a rule that would apply 
equally well to all imaginable sets of messages. 


The order in which the messages were presented 
to the subject was: 


1.ZPZZPN 
2.ZPNPNPNPNPNPNPZN 
3. ZPZN 
4.ZPZZZZZZZZZZZZZPN 
5. ZPNPZN 

6.ZPPN 
7.ZPNPNPNPNPZN 

8 ZPZZZZZZPN 

9. ZPZZZZZZZZZZZZZZPN 
10. ZPPPPNZNNNNNP 
11.ZPPPNZP 

12,.ZPNZNP 
13.ZPNPNPNPZN 
14.ZPPNZP 

5S. ZPZZZZZZZ2222NP 

16. ZNZNNNNP 
17,.ZPNPNPNPNPNPNPNPNPZN 
18. ZPZZZPN 
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19. ZPZZZZZPN 

200. ZPNPNPNPNPNPNPNPZN 
Mm: ZPPPPPPPPPPNZNNNP 
22. ZPZZZZZZZPN 

23. ZPZZZZZZZZZPN 
24.ZPNZNNP 

25. ZPNPNPNPNPNPZN 

26. ZPZN 

27. ZPNPNPNPNPNPNPNPNPNPZN 
28. ZNZP 

29. ZNZNP 

DOD: ZPPPPPNZP 


At this stage the subject has received seven 
cards and has arranged them into three sets as 
follows, which we shall label as A, B, C, for 
convenience : 

A. ZPZZPN 

ZPZZZZZZZZZZZZZPN 
B. ZPNPNPNPNPZN 
ZPNPNPNPNPNPNPZN 
CG... ZREN 
ZPZN 
The seventh card, ZPNPZN, is left isolated for 
the time being. 

The next card is ZPZZZZZZPN. “Again I see 
the same thing within cards,” the subject says, 
apparently with some satisfaction. “I see units 
repeating in the cards within each class. For 
example, there is an end unit, two end units, 
in fact, and the intermediate units are the same, 
just varying in number. In other words, I count 
the number of ZIRs in the middle unit in both 
cards. It may be five in one case and six in 
another, and this holds for all my cards, I think. 
In other words, in all cases the ends and the 
beginnings are the same. It is the middle that 
varies.” 

The next card is ZPZZZZZZZZZZZZZZPN. 
The subject says, “This classification seems to be 
working out very nicely, so I will continue.” 

The next card is ZPPPPNZNNNNNP. There 
is a long silent pause; clearly, something new has 
appeared. The experimenter reminds the subject 
to keep talking. 

“All right,” he says, “now we have another 
card. I am trying to see what category this will 
fit in. I am looking at the first three syllables 
and again I see the same pattern emerging, I 
think. I notice now, as a side observation, we 
seem to have only three basic categories. And 
each of these categories is divided into subclasses 
which consist of cards having the same beginnings 
and endings and the same intermediate units, vary- 
ing in number.” The subject places this card, 
ZPPPPNZNNNNNP, together with ZPPN and 
ZPZN. 

The next card is ZPPPNZP. “Again, the 
same pattern. I mean to collect more cards and 
see if it continues. And as soon as I get a few 
more cards to form my third category I am going 
to see if there are any more relationships observ- 
able between the major categories themselves.” 


The next card is ZPNZNP. “Oh,” says the 
subject, “now I notice that I made a mistake. 
I see that whereas I thought there were three 
categories I now see there are four. The end 
units of two of my cards in one of my classes is 
different.” He indicates Set C consisting of ZPPN, 
ZPZN, ZPPPNZP, and ZPPPPNZNNNNNP. 
“In fact, there are considerably more categories 
than I thought. My first category (Set A) is all 
right. The beginnings and ends are the same, and 
the intermediates are the same. Similarly, my second 
category (Set B) is all right. Just this one I was 
trying to make my third category seems to be 
wrong. It does not seem to follow the same pattern 
the other two do. I just assumed it did, but I see 
that it doesn’t. I could attempt to follow the same 
classification, that is, instead of having three cate- 
gories, I could have five or six. In other words, | 
will continue to see if a pattern emerges. It looks 
like I will continue with this new category. It 
appears, though, that I may have to discard my 
old standards, or criteria.” 

The subject has been given no hint as to how 
many categories there should be; concern with 
the number is his own attempt to define the terms 
of the task. The desire to have as few categories 
as the evidence will allow seems to be an important 
consideration governing many subjects’ satisfaction 
with their own efforts. Moreover, like most sub- 
jects, he wants to use the same rules for every 
category; it feels inconsistent, or at least untidy, 
to have a different description for each different 
set of messages. 

Note here, incidentally, that it is his third 
category with a double variation in the middle that 
is causing his troubles. 

At this stage the subject has received 12 cards 
and has arranged them as follows (though not 
necessarily in the same order within classes) : 

A. ZPZZPN 

ZPZZZZZZPN 
ZPZZZZZZZZZZZZZPN 
ZPZZZZZZZZZZZZZZPN 
B. ZPNPNPNPNPZN 
ZPNPNPNPNPNPNPZN 
C. ZPEN 
ZPPPNZP 
D. ZPNZNP 
ZPPPPNZNNNNNP 
Two cards, ZPZN and ZPNPZN, are left as 
odd cards, not yet assigned to any definite category 

The next card is ZPNPNPNPZN. The sub- 
ject says, “I just noticed something I hadn't 
noticed before: that all the cards begin with 
ZIR.” Has he rediscovered something he remarked 
on first after the fifth card? Or is he noting that 
his earlier generalization is still valid? He says, 
“There no longer seems to be the tendency to 
have the same end units.” Perhaps he means that 
the beginnings of the messages are still predictable, 
even though the endings are beginning to seem 
more variable. Unfortunately, it is not always 
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possible to understand what a subject means; t 
interrupt him by questions would interfere to 
much with the processes the experimenter is trying 
to observe. 

The next card is ZPPNZP. “I noticed that the 
card I just received is like the first card in one 
of my categories, the simplest card, only this has 
additional symbols at the end.” He places ZPPNZP 
together with ZPPN and ZPPPNZP. “This could 
not occur with any of my other cards,” he says 
“T will continue.” He means that the endings of 
two of his cards in a category could be different 

The next card is ZPZZZZZZZZZZNP. “Now 
this card fits into one of my earlier categories. | 
will put it there.” 

The next card is ZNZNNNNP. 
long pause. Then, “Now I am lost.” 

“Why are you lost?” 

“TI don’t see relationships occurring between thes: 
last five or six or seven or eight cards. 1 cannot 
see where IJ can fit them sensibly into the cate- 
gories. They don’t seem to be fitting into th 
same nice pattern that the earlier cards did.” He 
leaves this card unassigned to any of the groups 

The next card is ZPNPNPNPNPNPNPN- 
PNPZN. “This one fits into one of my earlier 
categories. What I would like to see... I would 
like to get more information in order to decide 
whether or not I have to abandon my older classi- 
fication. Maybe these cards that I have will fit 
better if a card could be used to establish a new 
category using the same criteria as before. It is 
not apparent, however, whether this will be useful 
in sorting these materials. We have sort of nev 
materials here. I 


There is 


The two categories that I have 
seem to be filling in very nicely; they seem to be 
complete series, whereas these other cards don’t 
fit anywhere, yet.” 

The cards that are giving our subject so much 
trouble all belong together; the confusion is 
arising because there are two variables in this 
set as opposed to only one variable in each 
Sets A and B. 

While the subject was talking he reached for 
another card, ZPZZZPN, which he placed in on 
of his earlier categories. 

The next card is ZPZZZZZPN. He greets 
like an old friend and disposes of it quickly. “Now 
this one, too, fits into my older category, so | will 
place it there.” 

The next card is ZPNPNPNPNPNPNPNPZN, 
another old friend. “And, similarly, this one fits in 
all right.” 

The next card is ZPPPPPPPPPPNZNNNP. 


I 





A stranger, alas. “Now, I have this card... . ] 
see another pattern emerging, I think. . . . There 
are ... now, let me think .. . yes, it’s just that 


I didn’t discover what the intermediate unit was 
in some of these, assuming it to be two syllables, 
like the more obvious one in the other set,” he 
says, comparing this card with the one just 
preceding it. “I see now that the end syllable 
consists of just one—I mean, the ending consists 


of just one syllable, and the intermediate ones are 
more than one syllable. I can form another clas- 
sification, using the beginning three symbols and 
ending in one symbol, at least for some of my 
cards. I can fit them into my classification. In 
other words,” he says, revealing once more his 
desire to find a consistent basis for the classifica- 
tion, “I have formed a third category which is 
similar to the other two larger categories I have 
(meaning Sets A and B). And I still have four 
cards left over.” He refers to ZPPN, ZPZN, 
ZPNPZN, and ZNZNNNNYP as the four that are 
left over; then, returns to his new, or third, 
category in search of a simple rule: “Now, let's 
see if we can see the same sort of relationship 
here. Ah, it seems to be emerging. There is an 
end unit which is constant and there seems to be 

middle unit which seems to be constant. So we 
can have two ends and a middle; then, in between 
ends and the middle, there are other units which 
are the same. The same unit is added, only just in 
different numbers. So we have the same sort of 
process as before, except a little more complex. 
Now the question is, how to classify these cards. 
We will make a category out of all the cards that 
have the same two end units and the middle unit 
and I will classify them in order of their relation- 
ships depending upon the number of units in the 
beginning, and then the number between the end 
and the middle. If there is one unit, it will be the 
first card; if there are three units, it will be 
the second card, assuming that there isn’t one 
with two units in between. And if I have two 
cards with the same number of units repeating in 
between the beginning and the middle units, then 
| will use the second side—f mean, the number of 
syllables that is between the middle and the end is 
my second way of classification. I will rearrange 
these now.” 

Notice how the subject distinguishes between 
variant and invariant units in the statement of his 
rules. The approach might now be characterized 
as Initial-~-Middle-y-Final, where x and y repre- 
sent the variable parts. And he also has the rule 
well enough in mind to use it for ordering the 
ards within each category. 

“When I come to rearrange these,” the subject 
says, “I see this isn’t the case.” 

“What isn’t the case?” 

“That the two ends and a middle unit are con- 
stant with all the cards, and there is an extra 
syllable. So, I am going to wait.” 

\t this stage the subject has received 21 cards 
and has rearranged them as follows (though not 
necessarily in the same order within classes) : 

A. ZPZZPN 

ZPZZZPN 

ZPZZZZZPN 
ZPZZZZZZPN 
ZPZZZZZZZZZZPN 
ZPZZZZZZZZZZZZZPN 
ZPZZZZZZZZZZZZZZPN 
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B. ZPNPNPNPZN 
ZPNPNPNPNPZN 
ZPNPNPNPNPNPNPZN 
ZPNPNPNPNPNPNPNPZN 
ZPNPNPNPNPNPNPNPNPZN 

C. ZPNZNP 
ZPPNZP 
ZPPNNPP 


ZPPPPNZNNNNNP 

ZPPPPPPPPPPNZNNNP 
Four cards, ZPPN, ZPZN, ZPNPZN, and 
ZNZNNNNP, remain unclassified. Although he 
is making progress, he does not feel completely 
confident about the third category. [At this point 
the reader should try, if he has not already, to 
fit the four odd cards into the existing categories ; 
he will discover that this subject, like the majority 
of subjects, finds it difficult to invent zero when 
he comes to count the number of times a unit 
recurs. | 

The next card is ZPZZZZZZZPN. “This one 
fits into one of my older classifications, and in my 
first big category.” 

The next card is ZPZZZZZZZZZPN. “And this 
one, too. I will put it there.” 

The next card is ZPNZNNP. “Now I notice 
these... here... no, I'll see if it fits into one 
of my categories and if not, I will put it with these 
other cards and see if I can establish a new 
category with them. Now, this last card seems 
to be similar to two of my cards that I had set 
aside.” He puts ZPNZNNP together with 
ZPPPPNZNNNNNP and ZPPPPPPPPPPNZ- 
NNNP. “They have the same beginning and the 
same end and pretty much the same middle unit, 
but not quite, however.” 

The next card is ZPNPNPNPNPNPZN. “This 
fits into one of my older categories.” 

The next card is ZPZN: it is the same as the 
third card he had received earlier [pause]. “This 
one I will place with that and that,” he indicates 
ZPNZNP and ZPNZNNP. “There’s . . . [a 
pause] it just occurred to me that, although there 
appears to be a relationship, it is probably only 
by virtue of the fact that there are only four 
symbols we are using.” The mistake persists; he 
still does not realize that, in fact, only three 
symbols are used. “However,” he continues, “there 
does appear that there is an order. I mean, if 
I were to divide it like I have two end units and 
middle units, it appears that there is another 
syllable appearing in between the beginning and 
the middle and it is always the same unit. It 
just varies in number, much like the other classi- 
fication. So now I will order those. As a matter 
of fact, I see that three of my cards fit very nicely, 
actually, there are four that fit in very nicely. 
Five . . . oh, no, this is a little different. The 
end unit is different, so I have now established 
the fourth category. And I still have two cards 
left over that I am not quite sure what to do with. 


Let me check now and see if they don’t fit 
somewhere . . . no, they don’t seem to fit.” 


The next card is ZPNPNPNPNPNPNPN- 


PNPNPZN. “This fits in very nicely with one 
of my earlier categories.” He disposes of it 
quickly. 


The next card is ZNZP. “This card does not 
seem to fit anywhere.” 

The next card is ZNZNP. He pairs it with 
ZNZNNNNP, saying, “Now I have two odd 
cards that seem to fit to form another category, 
using the same idea of picking out syllables inter- 
spaced in between, just repeating. There is just 
one thing I want to say pertaining to one unit. 

. So, I now have five categories and two extra 
cards which, temporarily, I will call separate 
categories.” 

The next card is ZPPPPPNZP. 


menter says, “This is your last card.” 


The experi- 


The subject says, “This one, I think, fits into 
one of my earlier categories. Now I want to 
look over these couple of categories and see how 
| stand on those. I want to make sure that I have 
really followed the rules I think I established.” 
“You can take all the time you want to,” says 
the experimenter, “but don’t forget to talk.” 
At this stage the subject has received 30 cards 
and has arranged them as follows (though not 
necessarily in this order within classes) : 
A. ZPPN 
ZPZZPN 
ZPZZZPN 
ZPZZZZZPN 
ZPZZZZZZPN 
ZPZZZZZZZPN 
ZPZZZZZZZZZPN 
ZPZZZZZZZZZZPN 
ZPZZZZZZZZZZZZZPN 
ZPZZZZZZZZZZZZZZPN 

B. ZPZN 
ZPNPZN 
ZPNPNPNPZN 
ZPNPNPNPNPZN 
ZPNPNPNPNPNPZN 
ZPNPNPNPNPNPNPZN 
ZPNPNPNPNPNPNPNPZN 
ZPNPNPNPNPNPNPNPNPZN 
ZPNPNPNPNPNPNPNPNPNPZN 

C. ZPNZNP 
ZPNZNNP 
ZPPPPNZNNNNNP 
ZPPPPPPPPPPNZNNNP 

D. ZPPNZP 
ZPPPNZP 
Zereeree 

E. ZNZNP 
ZNZNNNNP 

Two cards, ZNZP and ZPZN, remain unclassified 

“Okay,” the subject says. “Well, I'll start out 
with my biggest and main categories because the 
relationships are clear, so I’ll make sure that they 
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follow all the way through. Now, they all have 
the same beginning and end syllables. There is no 
question about that. And, similarly, the second 
and next-to-last symbol in all—the beginning and 
end units are similar in all cards. In my first 
category the intermediate symbols are always ZI Rs 
and they occur in varying numbers. Let’s see, 
one, two, three, skip some; I can order thes« 
according to the number of ZIRs in between. | 
don’t suppose it matters as long as I am classify- 
ing rather than trying to interpret the information 
i can’t seem to decide between these: they seem 
to be homogeneous.” 

He shifts to Set B. “T’ll take the next one 
Again, we have two syllables that are the sai 
[ mean, they are the same from card to card, 
beginnings and ends. And we have in between 
them, rather than one syllable repeating, a pair of 
syllables repeating . . . and always ending in the 
same one. So I order these according to the 
number of insertions.” 

He shifts to the Set C. “Now I am going t 
look at one of the more complicated categories 
Here it seems there’s a beginning unit that consists 
of two syllables on each of the two cards and the 
end unit which consists of only one syllable” [short 
pause]. “In addition, there is a middle unit that 
consists of NEN ZIR, or whatever it is. And 
in between there are varying numbers of syllables 
between the beginning units and the middle unit 
In this category there are varying numbers of 
syllables PAG, so I am going to arrange them in 
this order. This will be my primary ordering 
process. And, should there be duplication in the 
number of PAGs, then I would check the number 
of symbols on the other side of the middle part, 
and order them according to their number” [pause] 
“T see that this does occur . . . so, they are ordered 
first with respect to the symbols between the 
beginning and the middle, and then with respect 
to the symbols between the middle and the end 
Should there be any ambiguity in the first order 

. it appears like a very nice... .”, He combines 
his next three categories into one on the basis 
of the rule just stated; the result is a_ third 
category: 

C. ZNZNP 

ZNZNNNNP 

ZPNZNP 

ZPNZNNP 

ZPPNZP 

ZPPPNZP 

ZPPPPNZNNNNNP 

ZPPPPPNZP 

ZPPPPPPPPPPNZNNNP 
“Now,” he continues, “I have the cards in three 
main categories, and I'll see if they will hold up 
Again, they begin with, and end with, the same 
unit; and the same pattern of middle syllables 
repeats.” 

He shifts to the remaining odd cards, ZNZP 
and ZPZN. “Now let’s see if my last category. 

I’m not sure what to do with that—mak« 
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sure if it doesn’t fit into one of the old ones... 
it doesn’t as such .. .” [long pause]. I’m glancing 
over all the categories now to make sure that this 
last category doesn’t fit into a subset of those” 
|pauses while looking]. “I think. ...” He picks 
up ZPZN. “I now notice that one of these cards 
does fit in to the, ah, my earlier category. 

It’s curious, I didn’t notice it before, but it seems 
to be identical with the first one in one of my 
categories. Well, I’ll place it there.” 

There is a long pause while he considers the 
remaining odd card, ZNZP. “It seems a shame 
to have a one-card category. I’ll look over once 
more to see if we can find a place to put it. . 
this odd card.” 

\fter a little more search he gives up. “I will 
leave it as it is,” he says, “I am happy with my 
classification.” 

The above technique was abandoned in 
all the studies to be described below in 
favor of a simpler and quicker one: all the 
cards were simultaneously presented to the 
subject and no running protocol was re- 
corded as the subject sorted the strings 
Comments the subject wished to make or. 
his method of sorting were recorded only 
at the end of the experimental session. 


THE PROBLEM 


This account of a subject’s responses to 
a problem solving situation has been recon- 
structed from a tape recording. The sub- 
ject is confronted with the problem of 
processing information which he received 
a little at a time. We can think of the sub- 
ject as starting the task in total darkness, 
for he has almost no idea what kind of in- 
formation is encoded in the messages. What- 
ever he does with the messages is entirely 
a matter of his own decisions. 

After watching a dozen or more subjects 
perform in a similar manner, one is im- 
pressed with several things. The first is 
the similarity between this laboratory epi- 
sode and ordinary behavior shown in deal- 
ing with our daily environment. All day 
long stimuli impinge upon an organism in 
a haphazard array; unless he can rapidly 
sort and file information in the right com- 
partments his life will quickly revert to 
chaos—to the “buzzing, blooming con- 
fusion” of the newborn child. Man orders 
his environment according to rules that he 
develops for himself or which he learns 
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from others; by this recoding technique he 
can economically handle the multiplicity 
and the variability of his environment for 
present and future use. 

A second impression one gathers is that 
men have learned to behave in this orderly 
manner without any specific instructions and 
without any knowledge of results provided 
by a teacher or an experimenter. Man 
seems to respect structure in the world 
wherever he finds it, regardless of conse- 
quences. It does not seem necessary here 
to invoke any hedonic contingencies—un- 
less perhaps orderliness carries its own 
rewards: a satisfaction with the goodness 
of fit. The subject is uneasy when a string 
of symbols does not fit one of the categories 
he has evolved. What one means by the fit 
here is more subtle, and the lack of fit less 
obvious, than in a mechanical or a picture 
puzzle; without any outside guidance or 
control the subject moves from a search 
for isolated perceptual relations into a 
search for the conceptual pattern under- 
lying the stimuli that he receives. 

The behavior of our subject emboldens 
us to argue for an implicit conception of 
structure in man. The argument is, of 
course, not new to psychology. It was first 
brought into lively focus by the Gestalt 
psychologists as part of their protest 
against the Wundtian tradition of elemen- 
tism and against the behavioristic denuncia- 
tion of everything mentalistic. The Gestalt 
psychologists did not share with Behavi- 
orism its contempt for mentalistic concepts, 
nor its distaste for introspection. On the 
contrary, they were against any oversimpli- 
fied mechanistic view of man; they urged 
the importance of value in a world of facts 
(K6hler, 1938). Meaningful relations 
(Kohler, 1929; Wertheimer, 1945), organ- 
ization (Kohler, 1929), fitness of parts into 
an organic whole (Koffka, 1935), required- 
ness (K6hler, 1938)—these were some of 
the values they wished to restore to a world 
hacked up into elements. 

The emphasis on relations is not a novel 
thought, either. Even Thomas Brown 
(cited in Woodworth, 1948) early in the 
nineteenth century, and von FEhrenfels 
(1937) later in that century, had stressed 


MN 


the importance of relations. But the notable 
contribution of the Gestalt school was to 
demonstrate the crucial role of structure, 
or pattern, both in the stimulus configura- 
tion and in the responses of the observer. 
According to them the perceptual process 
is essentially one of organization. The brain 
is a dynamic neural center where perceptual 
objects combine or separate on the basis 
of similarity, contiguity, symmetry, com- 
mon fate. K6hler’s conclusion (1947) that 
an “organism responds to the pattern of 
stimuli to which it is exposed” (p. 103) 
seems to fit the illustrative account of our 
subject’s responses to a decision making 
task. We cannot help but notice how often 
the subject is concerned with the “fit” of 
the messages into some category he had 
established: he is continually seeking “‘re- 
lationships,” he talks about “pattern,” he 
is trying to fit the parts into a total scheme 
he has set up to solve his problem. In fact, 
Wertheimer (1945) might have been de- 
scribing our own subject instead of the 
children who solved geometry problems, 
when he wrote: “In all these cases we have 
structural changes, tendencies toward the 
better structure, toward fitting, with the 
disappearance of disturbances” (p. 57). 
Since the notion of structure, or pattern, 
as proposed by the Gestalt psychologists, 
seems essential for the description of our 
subject’s behavior, it becomes the starting 
point of our inquiry. The notion of “pat- 
tern,’ however, as we use it here, is some- 
thing of a special case of the Gestalt notion. 
We are interested in what Miller and 
Chomsky (1958) have called “pattern con- 
ception.” For the purpose of the present 
experiments we define pattern as the invari- 
ance of a sequence of symbols in the con- 
text of lawful change, and we simplify its 
meaning by defining pattern in terms of 
internalized rules, or relations, a subject 
employs to organize novel information. 
Rules usually imply a notion of abstraction 
and generalization; they go beyond the in- 
formation given. They cannot be said to 
be covered either by the transposition ex- 
periments such as those of Kohler (cited 
in Osgood, 1956, p. 495), nor by the studies 
of stimulus generalization conducted by the 
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Behaviorists (Hovland, 1937a, 1937b; 
Razran, 1940, 1949). In both cases the re- 
sults could be explained on the basis of 
sensory relations alone. By the use of rules 
the subject searches for pattern in order to 
recode the sensory input into manipulable, 
meaningful units. Perhaps a limitation on 
the information capacity of the human 
channel makes such behavior (recoding) 
necessary. In any case, it is our wish to 
find out something about these rules or pat 
terns by which the subject handles informa 
tion. We would like, therefore, to make a 
systematic study of some of the experimen- 
tal variables that affect the nature of the 
rules_ that employ in 
search for pattern in the stimulus input. 
One could begin a study of rules by ask 
ing the usual questions of What? How? 
and Why? Hull (1920) is one among those 
researchers who tackled the question What! 
“What does a child learn,” he 
“when he calls a certain object dog?” This 
question is part of a broader question: 
What does any individual learn when he is 
said to have learned a concept? To find an 
answer Hull required adult human sub- 
jects to learn some Chinese sounds paired 
with certain characters in the 
alphabet. Hull’s conclusion was that what 
an individual learns is a rule about the 
common characteristics of a class of ob 
jects. He placed the emphasis on the com 


order t¢ 


subjects 


asked, 


Chinese 


mon characteristics of the stimuli; the rule, 
Hull equated with discrimination learning. 
It is, therefore, not clear to the reader 
whether Hull’s subjects learn a rule or 
whether they merely learn a name for the 
rule. 

A second answer to the question What? 
\ccord- 


ing to a second view what the subject really 


is contained in the word relations. 


learns is some kind of a rule of relations by 
which he groups instances as members of a 
class. A search for common relations among 
instances might take the subject beyond 
the common perceptual elements to some 
thing more. For the Gestalt thinkers this 
something more is the “organic whole” which 
is more than the sum of its parts. Goldstein 
and Scheerer (1941) describe it as “ab 
straction.” According to them the process 


of abstration is different from that of ag- 
gregation of common elements under one 
label. Smoke (1932) discusses this same 
idea as symbolic reaction. Our own ap- 
proach to rules is in terms of relations and 
we view rules of relations as pattern con- 
ception. 

The question—how rules are learned or 
made—has been ingeniously handled by 
Bruner, Goodnow, and Austin (1956), and 
recently by David Reiss (1958). 
Bruner et al. systematically investigated the 
methods by which subjects categorize given 
events. The categories into which events 
are to be placed are predetermined by the 
experimenter’s rules. It is the task of the 
subject to discover the experimenter’s rules 
for placing the given events into their 
proper boxes. In order to categorize suc- 
cessfully the subject uses “strategies,” or 
plans, by which he searches for informa- 
tion which would be useful to him in con- 
structing his hypotheses about how the 
events are distributed into the experimen- 
ter’s boxes. This stepwise process—of 
strategy construction, searching for infor- 
mation, hypothesis building, testing hypoth- 
eses—the authors describe as the process 
of concept attainment. Of these steps, the 
one of strategy construction is the most 
Bruner et al. (1956) describe 
strategy as “regularities in  decision- 
making,” and define it as “a pattern of de- 
cisions in the acquisition, retention, and 
utilization of information that serves cer- 
tain objectives (p. 54). A “host of factors” 
influence the construction of strategies in 
concept attainment, according to these re- 
searchers. 

In his discussion of the construction of 
strategies by subjects, Reiss takes Bruner’s 
“host of factors” for granted, and he pro- 
ceeds to study how a subject constructs his 
“model” of what the rules that govern the 
stimuli might be, but Reiss studies this 
without constraining the subject strictly by 
the experimenter’s rules. He varies the 
quantity of evidence given to the subject 
initially, and then observes how this amount 
of information affects the subject’s be- 
havior in model attainment. Reiss finds 
that the subject’s model is in fact seriously 


more 


critical. 
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influenced by the amount of information 
initially presented to the subject. 

In all the studies discussed above, except 
the last one, the experiment concludes when 
the subject reaches the solution as the ex- 
perimeter stipulates it. Hull’s subjects 
continue to learn the list until the criterion 
is reached. The criterion, of course, means 
the one that the experimenter has set up. 
Hanfmann and Kasanin (1937) turn over 
a block or two to suggest to their subject 
the “correct” grouping which he must dis- 
cover. Bruner, Goodnow, and Austin con- 
firm and infirm hypotheses. In none of 
these studies is the subject ultimately free 
to set up a criterion independently of the 
experimenter. Obviously then, the question 
arises whether the subject will proceed to 
work by rules even when he is free from 
the experimenter’s criterional constraints. 

Thorndike (1931) asked subjects to 
draw lines equal to a specified length with- 
out giving them any information about the 
accuracy of their results. Yet one notes in 
his results that the performance of sub- 
jects began to stabilize around some stand- 
ard implicit in the subjects’ behavior. In 
other words, the subject seems to be 
settling on some rule which he bases upon 
his own performance. It is immaterial 
whether he does this consciously or uncon- 
sciously. 

One notices a similar reliance upon a 
subjective standard in Reiss’ subjects as 
they construct models. The “random” 
group, whose hypotheses were confirmed 
or infirmed according to a prearranged ran- 
dom schedule, did not perform randomly 
at all. Instead, subjects constructed models 
quite early and began to test these on the 
basis of their own hypotheses, disregarding 
the information they received from the ex- 
perimenter. 

Systematic responses, which may be 
abruptly shifted to new modes of syste- 
matic behavior have been observed by 
Krechevsky (1932), Witkin (1940), and 
Spence (1939) in animal subjects. By 
means of a four-point discrimination ap- 
paratus, Krechevsky presented an “in- 
soluble” problem to his rats. These sub- 


jects did not run around the apparatus in 
random fashion, however; they developed 
systematic modes of responses. Later, Wit- 
kin, substantiated these results on animals 
run in nonspecific problem situations, but 
his explanation was different. Spence ob- 
served the same phenomenon in chimpan- 
zees when he presented them with an 
insoluble problem in a multiple-choice ap 
paratus. 

We have mentioned these studies for 
three reasons: first, because all of them 
employ a notion of a criterion of success 
which is externally determined; second, 
that this notion of an externally imposed 
criterion introduces some contingencies of 
reinforcement for the subject; and third, 
that when the contingencies of reinforce- 
ment occur haphazardly, the subject disre- 
gards these and begins to develop some 
stable modes of responses on the basis of 
his own performance. 

All of these observations influence the 
design of our research; we deliberately take 
our point of departure from the first two 
and confine our interest to the third. We 
stated earlier that our interest is in rules. 
What kinds of rules does a subject make? 
How are his rules different from the ex- 
perimenter’s rules? Do these rules depend 
upon the nature of the stimuli, upon a 
knowledge of results, or upon the subject 
himself, or on a combination of these fac- 
tors? 

These and a host of other questions 
clamor for recognition and investigation. 
Twentieth century psychology has not been 
indifferent to this need. We have already 
seen how some of the earlier studies, 
characterized as dealing with concept for- 
mation, were investigating rules of induc- 
tion. Within the same framework, but one 
step further, Heidbreder (1946) introduced 
the idea of hierarchy in rules. The notion 
of hierarchy of rules implies the notion of 
complexity of strategies that subjects con- 
struct. In later studies the researchers were 
interested in the effect of rules upon learn- 
ing and recall. For example, Miller and 
Selfridge (1950) tested their subjects on 
memory of passages which were graded ac- 
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cording to their statistical approximations to 
English from zero to high order. The bet- 
ter recall of meaningful materials, the au 
thors explained on the basis of the sub- 
ject’s familiarity with the rules that 
governed the sequential properties of the 
meaningful materials, not the fact of mean- 
ing itself. 

Miller deals with the 
of rules of formation upon learning and 
recall in a later study (1958) in which he 
compares the free recall of redundant and 
random strings. Again he finds that even 
when the subject is unaware of the rules of 
formation that govern the structure of re 
dundant does better in 
memorizing the redundant strings. 
to this study of Miller’s, some experiments 
had been done by Aborn and Rubenstein 
(1952; Rubenstein & Aborn, 1954) to study 
the effect of rules of contextual constraints 


facilitatiing effect 


strings he still 


Prior 


upon learning sequences of nonsense syl 
lables. Test passages were developed by 
progressively limiting the context in which 


Thus 


the rules ranged from purely random 


the nonsense syllables could appear. 
highly redundant. These researchers found 


that when subjects were made familiar witl 
f formation, redundant se 


the rules ¢ 
quences were somewhat easier to learn, but 
the amount of information contained in the 


recalled materials was not constant from 
passage to passage. 
So far the rules we have referred to 


have been simple rules; they pertain to twé 
kinds of stimuli—unit stimuli and sequen 
tial stimuli—both of which have been gen 
erated with some kinds of rules of forma 
is a good example of the 


Miller’s rules, as 


tion. Language 


second type of stimuli. 


well as those of Aborn and Rubenstein, and 
of Reiss, deal w ith rules of this type, that 
is, with rules of formation that govern 


sequential dependencies. 

Rules that govern sequential properties 
of a language also have important effects 
upon its statistical (Miller, 
1951). Shannon (1949) analyzes the se 
quential properties of language by proba- 
bility rules, a technique that provides access 
to information measurements as well as to 
His rules deal with the selec- 


properties 


organization. 


IPSTONE 


tion of a message from a determinable 
number of alternative messages that could 
occur, and with the probability of its occur- 
rence. His rules deal with the amount of 
information contained in a message. Since 
Shannon’s concern is primarily with the 
communication of a message and not with 
its use, value, storage, meaning, history, or 
anything else, his rules are not of direct in- 
terest to us in the present context. His 
terms, however, and his techniques for 
measuring the amount of information 
transmitted through a noisy channel pro- 
vide the psychologist with some useful 
tools of analysis which information theor- 
ists (Attneave, 1959; Hovland, 1952; Luce, 
1954; Miller, 1953; Quastler, 1955) have 
applied for measuring organization. We, 
too, shall draw upon these aids. 

In contrast to Shannon, the communica- 
tion engineer, we can place Chomsky 
(1957), who looks at language as a gram- 
marian. His rules are more general and 
more formal than those of Aborn and 
Rubenstein, though not more difficult; he 
refers to his rules as “finite state gram- 
mars.” These simpler grammars enable us 
to avoid the complications of natural lan- 
guages and to work with.simpler structures. 
But although his notion of rules is formally 
simple, it raises a few complex questions 
regarding relation between rules and per- 
formance. Miller (1958) points to this 
problem and he would “like to know what 
changes, if any, in a set of rules leave per- 
formance invariant,” (p. 485). And so 
would we. 

Simple rules of formation of sequential 
stimuli do not always produce simple effects 
upon responses. Some rules facilitate per- 
formance (Anderson & Leonard, 1958; 
Fitts, Weinstein, Rappaport, Anderson, & 


Leonard, 1956; Miller, 1951) and some 
rules hinder (Anderson & Leonard, 
1958). It seems necessary, therefore, to 


search for parameters that affect the con- 
struction and use of rules of formation. 
It is the purpose of this study to explore 
some specific experimental variables im- 
plicit in these rules that influence the pat- 
terning responses of subjects. 
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The role of specific stimulus features in 
pattern perception has been previously 
studied by the Gestalt group, by Attneave 
(1954, 1955), by Fitts and his colleagues 
(1956), and by others (French, 1953, 1954b, 
1954c). Gestalt psychologists analyzed the 
peripheral properties in visual organization 
as proximity, similarity, continuity, and 
closed figure, but the process of organiza- 
tion itself could not, according to them, be 
understood without the field concept. They 
postulate an isomorphism between visual 
organization and the brain processes. Ac- 
cording to the principle of isomorphism the 
environmental field tends to become “psy- 
chologically organized” in terms of regular- 
ity, symmetry, and simplicity. Attneave 
translated and quantified the Gestalt fac- 
tors of organization in terms of informa- 
tion measure of redundancy in the visual 
input. He measured memory for simple dot 
patterns on recall. He found that good 
form, redundancy, and noise were some of 
the experimentally accessible variables that 
affect pattern perception. Fitts et al. at 
Ohio State University have done special 
work with redundant forms in visual pat- 
tern recognition. They developed metric 
figures of random and constrained type and 
they measured the amount of redundancy 
contained in these figures. They studied 
the correlation of a measured amount of 
redundancy with recognition of visual 
forms. Some additional variables, such as 
language habits and context (Miller, 1951), 
also have been found to affect the proba- 
bility of response. Carmichael, Hogan, and 
Walter (1932), for instance, demonstrated 
the influence of instruction upon response. 
Although we shall give a different inter- 
pretation of their results later, we shall not 
ignore the importance of instruction as a 
possible source of variation in performance. 

All the studies mentioned above suggest 
that a quantitative analysis of the experi- 
mental variables in pattern conception is 
possible. These studies also help to separate 
some parameters of pattern conception, par- 
ticularly those concerned with the number 
of different alternatives used and with 
structural similarity among the sequences. 
The number of different alternatives has 


proved to be an interesting parameter in 
studies in perception (Attneave, 1954) and 
in memory (Brogden & Schmidt, 1954a, 
1954b). Whether it is an important vari- 
able in pattern conception as well is the 
question we raise. Does a larger vocabulary 
help with the better ordering of the se- 
quences of symbols or not? If not, why 
not? One might expect the facilitatory 
effect of the larger number of different 
alternatives to reach an asymptotic value 
at some point. 

Structural similarity is another para- 
meter that must be explored. Its impor- 
tance in visual pattern recognition (I*rench, 
1954a, 1954b), and in learning and re 
call (Aborn & Rubenstein, 1952; Miller, 
1958; Rubenstein & Aborn, 1954) has 
been demonstrated. Does it also affect pat 
tern conception? The notion of structural 
similarity as used in this study should not 
be confused with Shannon’s notion of re 
dundancy. The languages we are working 
with are, in a sense, “degenerate” because 
in the limit their redundancy is 100%. We 
define structural similarity as similarity in 
rules of formation and specify it in terms 
of the number of basic strings as well as 
the number and placement of recursive 
loops within each string. 


Materials and Procedure 


Miller and Chomsky (1958) have pointed 
up some of the difficulties of method and 
material in laboratory studies of concept 
formation. In the earlier concept forma- 
tion studies the subject was constrained by 
the nature of the task to find the rules 
as the experimenter defined them. Further- 
more, the use of geometrical patterns differ- 
ing in size, shape, and color effected a 
certain stereotypy in material and usually 
limited them to a finite number of possible 
alternatives from which the positive in- 
stances were to be selected. Miller and 
Chomsky proposed the use of a new kind 
of material in the study of pattern con 
ception, viewing it as a different kind of 
task: instead of spatial patterns, they pro- 
posed to study sequential patterns. By using 
finite state systems for materials one could 








study pattern conception in a situation 
“modelled after the task of a grammarian.” 

A finite state language is an artificial 
language consisting of the set of well- 
formed strings (sentences) that can be 
generated by a set of simple rules (see 
Chomsky & Miller, 1958). This set of rules 
is a kind of algebra, or grammar, for iden- 
tifying the grammatical sentences in a par- 
ticular language. A finite number of sym 
bols makes up the vocabulary of a language. 
A finite state grammar consists of a finite 
number of states (S,, S,, S,,...) and rules 
for moving from one state to another. A 
state is defined as the set of all rules appli- 
cable to the state. Starting with any given 
state 0 there is a transition to a next state 
1 via a symbol. Thus, for example, 
i a> S; which can be written (0, a, 7); 
this is one rule of transition of the gram 
mar. A rule of a grammar determines 
whether there is a direct transition from 
the present state to one or more other 
states, or to the same state itself in the 
form of a recursive loop.” Figure | is a 
graphical representation of the finite state 
generator used in the introductory Illus 
trative Study. The numbered circles repre 
sent the states of the system and the arrows 
represent the rules of transition permitted 
from one state to the next. Irom S.,, for 
example, the grammar permits us to move 
to S,, then again to S.,, thus: 
a 


back 
This loop can be traversed any 
The nonsense words as 

arrows 


number of times. 

sociated with the 
finite set of symbols that comprise the given 
vocabulary of the language. This par- 
ticular finite state system has a vocabulary 
of three words—NEN, PAG, and ZIR, 


represent the 


2A recursive loop in a string is the unit that 
can occur in the string from zero to any number 
of times. Expressed graphically, a loop in the fol 


a 
lowing string S,—»S, —>SsIy—>s; occurs at S 
a b c 


and is indicated by the looping arrow a. A 
string could have any number of loops at any of 
the states. Each of the grammars in Experiments 
III and IV contains a double loop in one of its 
strings. 
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which could be abbreviated as N, P, and Z 
for convenience. One of these words is 
generated each time we move from one 
state to the next. For example, in S, the 
rules say either move to S, by generating 
the word PAG, or make a recursive loop 
by returning to the same state itself by 
generating ZIR any number of times. 


Upon the first visit to Sy a sentence is 
completed; a sentence is any well-formed 
string. A well-formed string is generated 
by always starting in S, and following any 
route in the diagram and ending in the first 
visit to S,-. The 


well-formed — strings 





RULES OF TRANSITiuWN 








(0,Z,!) (1,P,3) (3,Z,3) (5,P,2) (7,Z,8) 
(0,Z,6) (2,N,5) (3,P,4) (6,P,6) (8,N,8) 
(1,P,2) (2,Z,4) (4,N,0) (6,N,7) (8 ,P,0) 
BASIC STRINGS CANONICAL NOTATION 
ZPPN ZP (Z) PN 
ZPZN ZP (NP) ZN 
ZNZP Z(P)NZ(N)P 


Fic. 1. A graphical representation of the finite 
state generator of sentences used in the Illustrative 
Study. (An admissible string is any sequence of 
words generated by starting in state 0 and ending 
on the first visit to 0’. All admissible strings can 
be expressed by means of the canonical notation 
given below the figure. Loops have been omitted 
in the basic strings. The rules of transition given 
beneath the diagram comprise an equivalent de- 
scription of the generators. Each rule defines the 
state in which it is applicable, the symbol it gener- 
ates, and the next state in which the rule leaves 
the system.) 
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selected by the generator in Figure 1 are: 
ZPZN, ZPPN, ZNZP, ZPZPN, ZPNZP, 
ZNZNP, ZPNPZN, ZPNZNP, etc. Of 
these well-formed strings the first three we 
consider as basic strings. The important 
feature defining a basic string is that no 
state is visited more than once. When loops 
are inserted, states are revisited; basic 
strings with their loops we are naming 
formulas. Basic strings are significant be- 
cause, as Chomsky and Miller have shown, 
they provide the pattern for all the longer 
sentences in the languages; in grammatical 
terms, they are the linguistic “frames” into 
which the elements must fit. Any number 
of different sentences can be generated by 
following the rules and using the recur- 
sive loops any number of times; if we in- 
dicate loops by parentheses, however, all 
the sentences in the. System shown in 
Figure 1 can be represented by a canonical 
notation of the general formulae: 
ZP(Z)PN 
ZP(NP)ZN 
Z(P)NZ(N)P 

At the bottom of Figure 1 are stated the 
rules of transition, the basic strings, and 
the canonical notation for the formulas. 
For example, (0, Z, 1) is a rule of tran- 
sition; it means that when the system is 
in S, it can generate the symbol Z and 
thus move to S,. One direct path traced 
by a set of rules of transition, without 
loops, is a basic string: with loops it is a 
forma. Toes. vo, 2, 1), (1, P, 2) (2; 2, 
4), (4, N, 0’) yield the basic string 
ZPZN ; by adding the following two rules, 
(2, N, 5) and (5, P, 2), a double loop has 
been added to the basic string which gives 
the formula ZP(NP)ZN. 

Finite state systems provide attractive 
materials for experimental studies in psy- 
chology. Their formal simplicity and psy- 
chological complexity make them potentially 
useful for the study of higher mental 
processes. Moreover, the pattern and con- 
struction of finite state systems lend them- 
selves to quantitative handling by informa- 
tion measures. These materials are a new, 
useful, and quantitative tool to apply to the 
study of memory and thinking. For these 


reasons we choose to use finite state sys- 
tems as materials for exploring pattern con- 
ception. From a finite sample of sentences, 
subjects are expected to infer the under- 
lying grammar. To what extent controlled 
variation in the formal structure of these 
systems may affect the behavior of the 
subject broadly defines the scope of the 
present research. 

At the beginning of an experimental session a 
finite set of sentences from a particular language 
is presented to a subject in a rectangular array of 
cards on a table surface. The cards have been 
previously arranged by tables of random numbers 
and are all exposed simultaneously before the 
subject. (This procedure of presenting all the 
cards to the subject at once is different from the 
procedure followed in the Illustrative Study.) Each 
3” X 5” card contains one string of words only; 
the string is typed in capital letters on a single 
line. The subject sits facing the cards; he reads 
the typed instructions, then repeats them to the 
experimenter in his own words. 

Neutral instructions: “Here is an array of cards 
to be sorted into groups, into as many or as few 
groups as you wish to make and for whatever 
reason you wish to make them. When you are 
satisfied with your groups inform the experi- 
menter.” 

If a subject has any questions regarding the 
time limit he is informed that there is none. Other 
questions are answered by merely referring him 
to parts of the typed instructions. When a subject 
begins to study the cards the experimenter starts 
a timer manually; the experimenter stops the 
timer as soon as the subject is through with his 
sorting. After the subject has made his groups he 
is asked to generate two sample strings to match 
each of his groups and the experimenter records 
the results of the subject’s sortings along with 
any comments that the subject is willing to make 
about the way he performed the task. 

The following principles were implicit in the 
procedure: (a) the problem is presented in as 
unstructured a way as possible in order to give the 
subject freedom to structure the materials by his 
own rules, (b) the subject is free of time pres- 
sures, and (c) there is a three-way check (sorting, 
additional instances, and free comments) on 
whether or not the subject was using any con- 
sistent strategy in arranging the cards into cate- 
gories. 


Subjects 


Graduate and undergraduate students of Rad- 
cliffe College and Harvard University in the age 
group between 18 and 35 years, irrespective of 
field of specialization, were run as paid subjects. 
No subject served twice in any part of the experi- 
ment. 
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An ExpLoratory STUDY 
(Experiment IT) 


This experiment was a pilot study (a) to 
explore the kinds of strategies subjects use 
in grouping a given set of strings generated 
by finite state grammars (if, indeed, they 
used any strategy at all), and (b) to de 
fine some quantitative methods to measure 
the amount of in each system of 
categories developed 


“pattern” 
a subject. 


k, tralegics 


Definition. A strategy may be qualitatively de 
fined as the method or rule used by a subject for 
grouping a number of given strings int 
Operationally, a strategy is determined 
specific sentences that a subject puts into a subset 
A specific strategy equates with a specific pattern 
a subject uses for making his groups. (This use 
of the term is different from the way 
defines it—Bruner et al., 1956, p. 54) 

Quality. A strategy may be successful or Icss 
successful. It is considered successful if it co1 
responds completely with the basic strings of th 


subsets 


by the 


Brune: 


experimenter’s grammar. It is less successful i 
it deviates from the experimenter’s rules. The 
degree of correspondence between the experi- 
menter’s rules and a subject’s rules determines thi 


No strategy used 


by a subject is labeled as completely unsuccessful 


degree of the subject’s success. 


Procedure 





The materials and procedure followed in this 
study have been described above; they are essen 
tially the same in all the studies described below 
In this exploratory study nine different finite sta 
systems were set up in a 3 X 3 design; three 
different generators (grammar) of three, four, « 
five basic strings (see diagrams in Figures 2a, 2b, 
and 2c) were combined in all possible ways with 


three, six, or 


three sizes of vocabulary, either of 
nine words. Nonsense words were used in all 
languages in order to minimize possible associa 
A tenth languag« 


strings whicl 





tions with the stimulus sequences. 
consisted of sequences of random 
were generated with the help of tables of randor 
numbers. A vocabulary of three words was used 
for this set 

The langu e constructed with the follow- 
ing principles in mind: (a) Grammar III (five 
basic strings) is a hybrid composition from Gram- 
mars I and II. (+b) The grammars are constructed 
to facilitate the experimenter in detecting objec- 
tively the strategy used by a subject. This prin- 
ciple places a constraint of congruence of certain 
parts—initial and/or final—in the structure of the 
basic strings. (c) The placement of symbols on 
transitional arrows is determined partly by the 


ges wel 


SHIPSTONE 


reason given previot ty, partly by haphazard as- 
signment, (d) The namber of instances (samples) 
in all ensembles in this study is 16. This particular 
design of three types of grammar and three sizes 
of vocabulary was used to include strategy con- 
struction under a variety of conditions to be tested 
more specifically later. The random strings con- 
dition was designed to serve as control for the 
nonrandom conditions. The nonrandom conditions 
are the nine grammar groups. 

Sixty subjects were run individually, six sub- 
jects per condition. Subjects were distributed one 
per condition in the order in which they came. 


Analysis of Strategies 


Qualitative Analysis 


There were 16 instances of sentences from cach 
of the languages; the cards were numbered on 
the backs from 1 to 16. Each subject’s sorting 
was recorded in terms of these numbers. The 
grouping produced by a subject was then com- 
pared with the various groupings that would 
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Fic, 2. Finite state generators for sentences 
used in the Exploratory Study (Experiment IT). 
(The small letters on arrows indicate similarities 
and differences in the initial, final, and recursive- 
loop positions of symbols in the sentences gen- 


erated). 
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result from following the different strategies. 
Thus, if the subject’s groups were (3, 7, 15, 16) 
(1, 5, 10) (11, 12) (2, 6, 13, 14) (4) (8) (9), 
they corresponded with groups that resulted if the 
Length strategy were followed. Length strategy 
refers to grouping the instances by the number of 
words—the length of the sentence—contained on 
each card. The experimenter had worked out 
various groupings in terms of possible strategies 
so that a strategy could be objectively identified. 
Of course, the matching samples generated by 
a subject for each of his groups and the protocol 
were additional checks upon the identification of 
the subject’s strategy. 

Out of the billions of possible partitions of the 
16 instances only about 20 strategies, not always 
markedly different from each other, were found 
to be used by subjects. The five most common 
strategies were labeled as follows: 

Length: Subject puts together instances having 
the same number of words. 

Alphabet: Subject alphabetizes the whole set 
of instances, treating it as a single category. 

Initial : Subject sorts by similarity found in the be- 
ginnings of sentences. This similarity could range 
anywhere from one to four words 

Initial-Final: Subject sorts by the identicalness 
both of the beginnings and the endings of sen- 
tences. 

Formula: This strategy approximates to the 
experimenter’s rules for the finite state generator. 

Miscellaneous: This category is a cover-all 
group for isolated cases of sorting such as sort- 
ing by a single striking word, or by the repetitive 
elements, or by rhythm, etc. 


Quantitative Techniques 


Since a variety of strategies was employed, 
strategies ranging from partially successful ones 
to the one that does succeed completely, it becomes 
necessary to order or scale the strategies. Earlier, 
we defined a successful strategy as one that 
completely corresponds with the basic strings of 
the experimenter’s grammar. Three measures for 
scoring strategies were employed: an agreement- 
disagreement measure, a rating scale, and a time 
score. The first two were constructed specifically 
for evaluating strategies in this research. 

Agreement-Disagreement Ratio (A-D ratio). 
This statistic is a kind of correlation technique 
for measuring the degree of agreement and dis- 
agreement between the experimenter’s grammar 
and a subject’s categories; or between any two 
subjects’ categories. The ratio expressed (roughly) 
by the observed number of agreements over the 
total number of possible agreements in grouping, 
vields a measure of correspondence. When the 
two groupings are identical, the A-D ratio equals 
unity (see Appendix A for details of this tech- 
nique). 

Rating Scale (R scale). Each subject’s strategy 
was assessed on a 10-point scale developed for 


this purpose. This scale was set up in the follow- 
ing way. Each subject’s performance was classi- 
fied into a strategy by the methods previously 
stated. Approximately 20 strategies seemed avail- 
able to the subjects. Identical randomized lists 
of these were presented to five judges for rating 
The judges were graduate students at Radcliffe 
College and Harvard University. They represented 
different fields of experience and cultural back- 
grounds. The judges were instructed jointly to 
rank on a 10-point scale the strategies subjects 
used. The instructions given to the judges were: 
You have been handed a list of labels for the 
strategies subjects used in an experiment in 
sorting into groups a pack of cards prepared 
by the experimenter. [Judges were shown the 
cards and the experiment was explained to them. 
Also, the labels of the strategies were described 
or explained.] You are being asked to rate 
these strategies by assigning them values be- 
tween 1 and 10. Rate the poorest strategy | 
and the best one 10. You may assign the same 
value to several strategies if you think they rate 
the same and you may indicate distance in the 
quality of strategies by the size of the interval 
you leave between any two. Any questions? 
After the questions were answered the judges 
worked independently. From the averages of the 
scores assigned by the judges to the strategies a 
standard 10-point rating scale for evaluating 
strategies was developed (see Appendix B for 
details). 

Time Measure. A record of the total time re- 
quired for sorting by each subject was made. 
Thus the number of seconds per word provided 
a third measure of a subject’s performance. 


Reliability of Measures 


The A-D Ratio. Although this statistic is a 
fair measure of success in terms of the cor- 
respondence of a subiect’s responses to the ex- 
perimenter’s rules for generating the strings, it 
is subject to experimental artifact. For example, 
the Initial, strategy (instances grouped together 
on the basis of the first word only) yields a 
spuriously high correlation score. The number 
of subjects using the Initial: strategy is com- 
paratively small, however, and the other two 
measures offer a second check on the results. 

A second difficulty with this statistic is that 
interlanguage comparisons cannot be made without 
using some kind of correction factor for the num- 
ber of instances in the set. The measure of agree- 
ment rises with the increase in the number of 
instances; the measure is not really comparable 
from one language system to another. 

The R Scale. The following table shows the 
reliability of the ratings of strategies made by five 
judges. Rank-difference correlations were used. 

Tt should be noted that the equality of intervals 
between ranks in this scale cannot be established. 
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TABLE 1 


RANK-ORDER CORRELATIONS BETWEEN JUDGES FOR 
RATING STRATEGIES 


Judges 1 z 3 j 
2 .89 
3 94 77 
4 .88 .89 .82 
5 91 .87 .89 . 88 


Note.—From the ratings assigned by these judges to strat 
egies was developed the 10-point rating scale. 


Time Score. Although time measures have bee 
commonly used in studies of visual pattern per 
ception (Adams, Fitts, Rappaport, & Weinstein, 
1954; Anderson & Leonard, 1958; Fitts et al., 
1956), in the present research they are only a 
rough measure. Several factors are simultaneous] 
operative which decrease the reliability of the tim 
scores. As the number of instances in the set 
increases, the time for sorting increases dispropor 
tionately, paralleling the rule that time increases 
according to the amount of information that must 
be processed. Both length and complexity of tas! 
tend to increase time, but if a subject works with 
the self-instruction of speed, he finishes the task 
much more quickly. The difficulty here is circular 
The less time a subject spends with the problem 
the less information he extracts from the input 
and the less time he spends dealing with his 
problem. 


Results 


The distribution of subjects in the Ran 
dom group is compared in Tigure 3a with 
two of the nine groups (J,3 and ITI,9) 
with respect to the kinds of strategies used 
in each. The bars for the Random group 
and the (ITT,9) 
and a vocabulary of nine words) are longe1 
at the Condition (1,3) 
(three basic strings and a vocabulary o 


group (five basic strings 


opposite ends. 


three words) has more affinity with the 
Random group than is apparent from the 
figure; three of its subjects who wer 
classified under the 
tion in the type of strategy used actually 
used some kind of differentiated counting 
technique that was slightly more complex 
, 


than the plain counting involved in_ the 
Figure 3b compares the 


Miscellaneous condi- 


Length strategy. 
same three groups by the average number 
of different categories made by each in the 


SHIPSTONE 


N:6 FOR EACH GROUP 


a (0 (1.3)- Group 
(II ,9)~ Grove 
GB Rr anoom - Grove 





INITIAL FINAL FORMULA misc. 
STRATEGY 


3a. Frequency comparison of strategies by sub- 
jects in the Random and two grammar groups 
The Random group worked with 16 strings that 
had been haphazardly generated. The (1,3) group 
worked with 16 well-formed strings that had 
been generated with three formulas and a vocabu- 
lary of three words. The (III,9) group worked 
with 16 well-formed strings that had been gener- 
ated with five formulas and a vocabulary of nine 
different words.) 


N°6 PER GRAMMAR CONDITION 
'6* NUMBER OF INSTANCES IN 
b EACH CONDITION. 
FILLED BARS REPRESENT NO. OF 
DIFFERENT CATEGORIES. 
SPACED BARS REPRESENT THE 
7 MEAN WO. OF CATEGORIES, 


[MennnE 


RANDOM 





TRING 


3b. Mean number of categories for subjects in 
the Random and two grammar groups. (Solid 
columns—first in each pair—compare the same 
three languages for the number of different cate- 
gories made. Different categories refer to instance 
categories, e.g., one-instance category, two-instance 
category. Broken bars compare the same three 
groups for the average number of categories for 
each group.) 

Fig. 3. Frequency comparison of strategies and 
mean number of categories. 


first columns (filled bars). (By different 
categories we mean instance categories; for 
example, one-instance category, two-in- 
stance category, and so on.) Notice how 
the Random condition, where every in- 
stance belongs to a separate category, aver- 
ages to the largest number of different 
categories. The second part (spaced bars) 
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compares the groups for the mean number 
of categories made by each. Here the 
difference between the Random group and 
(1,3) group is negligible. 

Table 2 is a frequency tabulation of all 
subjects showing their distribution by the 
kinds of strategy they employed and the 
experimental conditions under which they 
had worked. Note that there are just 
about five or six types of strategy that 
cover all subjects under the various condi- 
tions tested. Also note that the better or 
the more successful strategies are used 
under some conditions more than in some 
others. For example, none of the 18 sub- 
jects used the Length or the Alphabet 
strategies when the number of alternative 
words used was nine. Ten of these subjects 
used some form of the Formula strategy. 
When the sentences were formed with a 
vocabulary of only three words, on the 


TABI 
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other hand, 5 of the 18 subjects used the 
simpler strategies and only 4 discovered 
the Formula strategy; all of those 4 sub- 
jects came from the same condition group. 

lor each of the 10 experimental condi- 
tions, Table 3 presents means which have 
been calculated by assigning values to 
strategies by the statistical techniques dis- 
cussed above. Because the experimenter 
worked with no rules in the Random set no 
A-D ratios can be calculated for that 
strategy. Ratios for Condition IIT are found 
to be higher for each constant condition 
of vocabulary size, except in the grammar 
Group (11,3), which may be treated as an 
exception in all the tables given subse- 
quently because of an artifact in the con- 
struction of the language. The group means 
for the R scores are tabulated in the second 
row. Here the Random group and (I,3) 
group are more alike, as we have com- 


.E 2 


FREQUENCY DISTRIBUTION OF SUBJECTS IN THE RESPECTIVE LANGUAGE GROUPS BY THE 


Type OF STRATEGY THEY USED 











Kind of Strategy 
Grammar ee Se ee Be ee ee Ee a. See te N 
Length Alphabet Initial Init.-Final | Formula Misc. 

Random; 3 1 1 1 6 
(1,3) 1 2 3 6 
(11,3) 1 1 4s 
(IIT,3) 1 1 1 3 

Total | (3) (2) (1) (5) 4)" | (3) 18 
(1,6) } 2 1 1 I ! 
(11,6) ae 2 2 | 
(ITI,6) 2 2 1 1 

Total (4) (5) (3) (4) (2) 18 
(1,9) 2 4 
(11,9) 3 1 2 
(III,9) 1 1 4 

Total ~- (4) (4) (10) 18 
Sum Total 10 2 11 13 18 6 60 





Note.—Figures in parentheses are sums of frequencies immediately preceding those totals. 


« These four subjects all belong to the (II,3) group. 
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TABLE 3 


Group MEANS FOR 


Mean Score 


0.3 13 13 I 
\-D ratios .46 173 616 
R score 1.83 ? 83 7.0 2 9 
Time score 6.04 6.52 1.04 6.36 


Note.—The means have been calculated by three ir 
Time scores represent average time taken per sylla 





mented before, and both are inferior to 
Groups (1,9), (11,9), and (III,9). The 
time comparisons in the third row are some 
what fuzzy. Sometimes a subject, who may 
have worked with the instances for a ver) 
settle very poor 

; , 


SK ymetimes, the Oj )} osite 


long time, will on a 
strategy at the end; 
is true. 

A comparison of the group mx 
vocabulary and by grammar by applying all 
three made in Table 4. 
Although the figures given in this table are 
not always directly comparable, their rela 


ans Dy 


measures iS 


tive increase or decrease with change in 


i 
the experimental condition is constant 


Discussion 


This first study was intended to explor 
the kinds of responses subjects make to ; 
] 


categorizing task. The task as we have d 


) 


sorting strings of words 
presented to a subject o1 


fined it is one 
simultaneousl) 

small cards. The subject is free to work on 
these instances for as long as he wishes, 





without 


receiving 


further help 


from the 


experimenter, however, or from the 
structions that are given to him. He 
neither told how many categories lh 


should 


make, nor 


tion of success or 


of success, if any, 


d he does 


self an 


failure. 


is he given any indica 


The cri 


terion 


L: 


is set by the subject him 


not have to form groups 


if he thinks no groups can be made 


Though the experimenter is somewhat 


in the 


dark as to 


what 


1 


the nature of the 


TEN EXPERIMENTAL CONDITIONS 


Language Group 


[,6 11,6 I11,6 1,9 11,9 I1I,9 

437 601 73 725 .805 85 
‘sa | 4.66 3.33 6.83 ao ph 
3.61 4.75 4.64 4.09 | o&d 6.05 


tatistical measures: A-D ratios, Rating scores, and Time scores. 


controlling variables is, he does, however, 
anticipate a few problems. Therefore, he 
sets up his first study under a variety of 
experimental conditions to get leads about 
the possible functional relations between 
the experimental variables and 
subjects’ The basic variables 
that the experimenter works with are: the 
formulas which provide the 


some of 


responses. 


number of 


TABLE 4 


A COMPARISON OF RELATIVE DIFFICULTY OF 
(GRAMMAR AND OF VOCABULARY AS INDICATED 
BY THREE INDEPENDENT MEASURES 


Grammar: Random I II Ill 
A-D ratios BS Bh 73 

R score 1.83 4.44 | 4“ 5.3 
Time/Word 4.85 | 5.91 | 5.59 


Vocabulary: Random 3 6 9 
A-D ratios .62 ee 
pone ‘Word ca 5.48 - eye 
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basic framework for the strings and the 
number of different alternative words with 
which he develops his formulas. 

The results illustrate several important 
features about subjects’ responses in an in- 
formation processing situation. First, sub- 
jects process information by rules, or as we 
have suggested earlier, by pattern concep- 
tion. Next, we note that the rules formu- 
lated by more than 90% of subjects can be 
classified under five headings: Length, 
Alphabet, Initial, Initial-Final, and For- 
mula strategies. Third, there is a hierarchy 
within these five or six types of strategies 
which has been identified in subjects’ re- 
ponses. These strategies vary from simple 
to complex. Finally, the results indicate 
that some experimental conditions are bet- 
ter than others for inducing the formula- 
tion of the more successful strategies. The 
reader will recall that the more successful 
strategies have been defined as those that 
have a higher agreement with the experi- 
menter’s formulas. The Random strings, 
which were haphazardly constructed, em- 
body a condition which is least favorable 
to the emergence or acceptance of pattern 
strategies in subjects’ behavior. Evidence 
from our data suggested that vocabulary 
and grammar may be two of the more 
effective parameters in pattern conception. 


Pattern Conception. What we had ob- 
served as a dramatic instance of orderly 
responses demonstrated in the introductory 
study, was confirmed by the responses of 
more than 50 subjects in this exploratory 
experiment. It is not a fortuitous matter 
that no subject, not even one from the 
Random group, handed the cards back to 
the experimenter in some haphazard ar- 
rangement or declared the task insoluble. 
On the contrary, in each case the subject 
used some kind of a system by which h- 
had looked for relations among the in- 
stances to create his categories. Even with 
the Random group some subjects worked 
very hard and long trying to find some sort 
of scheme to organize their information. 
By and large, the subject could verbalize 
the type of relations that he had discovered 


in terms of a rule or rules though he was 
not always able to do this with precision. 


Hierarchy in Rules. The thing that 
amazed us most was that nearly all subjects 
could be classified so easily within the same 
general types of five or six strategies— 
strategies which were, by and large, the 
same ones we noted earlier in the responses 
of our single subject described above in the 
Illustrative Study. In other words, a 
hierarchy in rules that was indicated in the 
individual’s responses seems to characterize 
the group responses as well. This is a 
two-sided observation: it points first, to 
the fact that responses under all conditions 
of card sorting described above are classi 
fable within five or six easily detectable 
strategies ; and second, that these strategies 
themselves can be ordered in complexity. 

We can classify these strategies into two 
types: the ready-made type and the in- 
vented type. Ready-made rules are inde- 
pendent of the nature of the information 
to be processed. The rule of counting, for 
instance, to make groups of equal length, 
already exists in a subject’s repertory and 
can be easily transferred to almost any kind 
of material. The ready-made rule applies 
whether the materials be heads of cattle, 
pictures in an art gallery, or the coded 
messages about the enemy’s moves. This 
rule of counting is an instance of a simple 
quantitative strategy that is highly useful 
in dealing with certain kinds of informa- 
tion, but not so appropriate for certain 
other kinds. The counting rule may take 
on slightly different shades, as counting the 
number of different words, or counting the 
number of repetitions of any word, etc., but 
essentially it is the same rule that is applied. 

Alphabetizing is here classified as a 
ready-made strategy because this rule, too, 
already exists in the everyday repertoire of 
a college student. Since it is always possible 
to alphabetize a given series of words of 
any language, it requires little ingenuity on 
the part of a subject to apply this rule. 

It is important to note that a subject 
loses information by applying the simple 
strategies of counting and alphabetizing to 
these not-so-simple sequences. We have. 
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therefore, evaluated such strategies as being 


less successful. A strategy is rated as less 
successful in proportion to the amount it 
deviates from the finite state grammar 
which governs the given strings. In the 
simpler strategies a subject seems to work 
more with the perceptual characteristics of 
the information than with the structural 
ones. 

Subjects in the Random group, which is 
the control group for all the nine experi 
mental groups, seem to resort to the simple 
strategies, particularly the one of counting, 
more than the subjects in the grammar 
groups seem to do, except the subjects of 
the (1,3) group. The responses of this 
group have close kinship with the responses 
of the Random group. When a subject in 
any group either finds the task too difficult 
for him or when he does not wish to exert 
himself, functionally, he is treating the 
grammar strings as if they were random 
strings. Hence the strategies that subjects 
would be found to be using in these in 
stances would be close kins to counting or to 
alphabetizing strategies. It seems highly 
probable that subjects in the (1,3) group 
were treating the grammar strings as ran 
dom strings. 

The observed fact that a subject in the 
Random group follows a counting strateg) 
does not necessarily imply that the subject 
has worked with one strategy only. Sub 
jects’ protocols bear evidence to the con 
trary. Several subjects described complex 
strategies that they had tried to apply to the 
strings, but which they had discarded be 
cause they always seemed to find too large 
a number of single cases. Notice that a 
subject is uneasy about having too man) 
one-case categories, and this is virtually 
what he would find in the random set 
where each instance is unique. After having 
searched for abstract relations, a subject is 
forced by the stimulus properties of the 
set to settle for something less. This sample 
of behavior is apropos to the basic argu- 
ment in this thesis. 

Strategies labeled Invented are different 
from the ones just described. We classify 
the Initial,-Final, strategy and the For- 
mula strategy under this label because it is 


the nature of the materials which suggests 
these strategies to the observer. A high 
degree of discovery, however, is also in- 
volved in these strategies, but since a sub- 
ject is unaware of the embeddedness of any 
formulas within the materials, we prefer 
to call them the Invented rules rather than 
the Discovered rules. This label, however, 
applies most appropriately to the Formula 
strategy. In this strategy the subject has 
to go beyond the obvious information. The 
subject has to tease out an invariant 
sequence of words in each of the instances, 
as also he must find the recursive units. 
The search for invariance and variance 
yields formulas which are the basic frame- 
work for the grammar strings. This type of 
response calls for a certain degree of ab- 
straction in order to discover the true pat- 
tern implicit in the strings. 

Invented strategies correlate more closeiy 
with the information given. Since the in- 
structions a subject receives are neutral, his 
only source of information is the set of 
strings he has before him. The amount of 
information he is able to extract from them 
is a critical factor in the way he will process 
the information. His efficiency in ex- 
tracting the maximum_amount of useful 
depends largely upon two, 
sometimes interdependent, conditions of 
motivation and stimulus complexity. 


information 


Motivation. Each subject seems to re- 
define the problem to himself and it is very 
important how he does define the task. 
Subjects who look for quick and easy solu- 
tion—whose motivation appears to be to 
escape the situation as quickly as they can 

do not get themselves involved in the real 
problem of finding relations among sets of 
instances. They tend to manipulate the 
more obvious characteristics of the stimuli 
and fail to extract useful information con- 
tained in the sequence order of the words. 
Though they might also see other little bits 
of isolated information, they tend to ignore 
these and proceed to apply the easy rules 
of counting and alphabetizing, thus mini- 
mizing the cognitive strain. 


Stimulus Complexity. Subjects who do 
not get seriously involved in the task may 
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or may not end with the Invented strate- 
gies. One subject in the Random group 
worked with her 16 cards for 41 minutes. 
She tried to make groups by initial similari- 
ties, by ratio relations, by sequence rela- 
tions, and by a method of substitution. But 
when none of these strategies worked by 
her criterion of success, she resorted, ac- 
cording to her own version, to “the simplest 
strategy of counting.” The task of seeing 
relations among the random strings is per- 
haps psychologically more complex than the 
task of seeing relations where relations do 
in fact exist. 

The results from all nine grammar 
group conditions of finite state systems 
where relations do in fact exist, do not 
distribute under the Invented strategies 
only. Some do and some do not. In Table 
2 we notice that the stimulus condition, 
where nine alternative words were used to 
develop the strings of sequences, as com- 
pared to three in the Random group, and 
three and six in the other two conditions, 
more than 50% of the subjects evolved the 
Formula strategy as compared to the 0% 
of the Random group and 22% in each of 
the other two. Of these, the 20% of sub- 
jects in the vocabulary three group, all 
come from a single condition, (II,3), 
meaning that none of the subjects from 
Condition (1,3) or (III,3) or (0,3) ar- 
rived at the Formula strategy. 

Are these differences, indicated in the 
percentages of subjects who evolve the 
Invented strategies and those who do not, 
functionally related to the complexity of 
the experimental variables under which 
subjects work? If so, what constitutes ex- 
perimental complexity? That is the ques- 
tion we turn to in the next section. 


EXPERIMENTAL VARIABLES 


In the exploratory study of the previous 
section evidence for a search for pattern 
was found. Sometimes this search was 
creative, sometimes transitive, but what was 
not clear from the data was why this differ- 
ence existed in the quality of performance. 
Informal evidence suggested that instruc- 
tion was one variable that needed to be con- 


trolled more precisely and the data indi- 
cated that some stimulus features of the 
input were contributing to the outcome. 
Accordingly, some experiments were carried 
out to assess the effect of the following 
four variables: 

1. Instructions. Four different instruc- 
tions were tested to select one that would 
maximally involve the subject in the task, 
but with minimal guidance from the nature 
of the instructions. 

2. Number of instances. The number of 
instances per formula was varied from two 
instances to three to four, to test the effect 
of the amount of evidence on the quality of 
strategy. 

3. Number of different alternatives 
(Vocabulary). Three or nine different 
words comprised the vocabulary for de- 
veloping the different language systems. 

4. Structural similarity (Grammar). The 
effect of the formal structure of the lan- 
guages, three rules as against five rules 
with variation in similarity between the 
rules, was also investigated. 


The Instructions 
(Experiment ITT) 


Materials 


Two new artificial languages, (1,9,16)» and 
(11I,3,23). were constructed to study the effects 
of four types of instruction on each. Language 
(1,9,16)» is constructed with three formulas in 
its grammar; its vocabulary consists of nine words. 
Sixteen instances were selected to make up this 
set. Language (III,3,23). contains five formulas; 
three words have been used for its vocabulary. 
This set contains 23 instances of well-formed 
strings. Two additional language conditions, 
(1,3,16) and (I,3,23), were also tested, both 
under “eliminate speed” instructions. Altogether, 
then, there were 10 language conditions that were 
compared in this study. 


Procedure 


The same general procedure described earlier 
was used in all experiments reported in this sec- 
tion. The specific variable to be considered here 
is the kind of instruction to be used. Ten groups 
of subjects, four subjects per condition, were run 
under four different instructions. No subject 
served twice in any part of the experiment. Sub- 
jects were distributed, one per condition, in the 
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order in which they came. The following four 


sets of instructions were used : 

1. Neutral. 
sorted into groups, into as many or as few groups 
as you wish to make, and for whatever reason 
you wish to make them. When you are satisfied 
with your groups inform the experimenter.” 


“Here is an array of cards to be 


2. Eliminate Speed. “Here is an array of cards 
to be sorted into groups, into as many or as few 
groups as you wish to make, and for whatever 
reason you wish to make them. Your performance 
will be judged not by speed, that is, not how 
quickly you are through, but by what you do 
When you are satisfied with your groups inform 
the experimenter.” 

3. Relations. “Here is an array of cards to be 
sorted into groups, into as many or as few groups 
as you wish to make on the basis of some relations 
that you can find in the members of the groups 
Your performance will be judged not by speed, 
that is, not by how quickly you are through, but 
by how well you discover relationships. When you 
are satisfied with your groups inform the experi- 
menter.” 

4. Logical. “Here is an array of cards to be 
sorted into groups, into as many or as few groups 
as you wish to make on some logical basis. Your 
performance will be judged not by speed, that is 
not by how quickly you are through, but by how 
logical you are. When you are satisfied with your 
groups inform the experimenter.” 


Results 


Table 5 presents the means, ranges, and 
critical values for the A-D ratios, R scores, 


and Time scores for 40 different subjects 
run under 10 different experimental condi- 
tions. The critical differences between 
groups were calculated by the Link-Wal- 
lace method of short-cut allowance for one- 
way analysis of variance (Mosteller & 
Bush, 1954, p. 305). This statistic uses the 
range as a measure of variation instead of 
the sum of squares. 

Conditions F, G, H, and I in Table 5 
apparently belong together. The difference 
between any two totals in this group is 
below 1.51 in the A-D ratio comparisons, 
below 18.80 in the R score comparisons, 
and below 41.59 in the time comparisons. 
It is clear, then, that variation in instruc- 
tions fails to produce any recognizable 
difference in the performance of subjects 
in language (III,3,23),. Presumably, the 
task involved in this language is too hard 
for the subjects. Even when the subjects 
(Column H) took longer than subjects in 
any other group, they failed to give efficient 
results. Since the complexity of this lan- 
guage masks the effect of instruction (if 
any), we shall concentrate on the simpler 
Grammar I, 

Conditions C, D, E, and J likewise 
obviously belong together. None of the 
differences between any two of these con- 


TABLE 5 


MEANS, RANGES, AND CRITICAL VALUES 


A-D ratios 
Language Instrictions |_—___ 
M R Cl 
A (I, 3, 16) Elim. Speed | _ .333 40 
B (1,9, 16), Neutral | .402 .433 | 1.95 
: Elim. Speed .69 25 
D Relations 95 .20 
E Logical 76 14 
F (III, 3, 23), Neutral | 48 21 1.51 
G Elim. Speed 50 43 
H Relations | .58 27 
I Logical |} 49 43 
pe, 3325) lim. Speed | .79 46 





R score Time score 




















| 
Mj} R | cv M R CV 
1.75] 3 | 2.38} 1.39 
4.25] 7 0.47 | 3.55 | 3.56 | 41.58 
7.5 6 3.37 | 2.25 
9.75| 1 5.49 | 4.35 
7.5 6 5.35 | 2.71 
4.0 2 | 18.80] 7.81 | 16.94 | 41.59 
5.25 5 | 9.04 | 5.99 
5.0 | 4 | 18.58 | 22.59 
5.75 | 5 | 8.55 | 17.70 
8.75 1 9.76 | 10.09 
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ditions exceeds the calculated critical values 
in any of the three measures. All of these 
four conditions belong under the simpler 
Grammar I, which also includes Condi- 
tions A and B. Subjects who served under 
Condition A performed most poorly, even 
though, as in C, D, E, and J, they were 
instructed not to speed. Subjects for 
(1,9,16), who served under Neutral in- 
structions (see Column B) do not resemble 
or differ consistently with the other groups. 


Discussion 


In the preceding section we raised a 
question about why some subjects follow 
easy strategies and some others follow more 
complex ones. An analysis of the subjects’ 
protocols gave an indication that one neces- 
sary, though not sufficient, condition for 
evolving a good strategy is that the subject 
should extract all the useful information 
from the instances he receives. This busi- 
ness of extracting the most useful informa- 
tion is in itself a very critical step in pat- 
tern conception. The success with which 
a subject decodes relevant information 
from the stimuli depends partially upon 
the degree to which he involves himself in 
the task. This in turn is influenced by a 
variety of factors, such as the past history 
of the subject, his present motivation, his 
age, etc., as well as the instruction he re- 
ceives from the experimenter regarding the 
nature of the problem, other things being 
equal. An individual’s history or his moti- 
vation is difficult to control by laboratory 
techniques, but the experimenter can ex- 
plore the instruction variable. 

Actually, a subject receives two kinds of 
information from the experimenter. One 
kind comprises the main body of informa- 
tion that is contained in the strings of non- 
sense words with which the subject is sup- 
posed to work; the other is some informa- 
tion about this main body of information 
to be categorized. The main body of infor- 
mation is contained in what Bartlett (1958) 
has called a “closed system,” which the 
subject must discover. The efficiency with 
which a subject processes the main body 
of information will in part depend on how 


.s) 


well he has decoded the information from 
the instances presented to him. It is the 
experimenter’s wish to get a subject started 
on this search but without telling him how 
to proceed. Thus in setting up the instruc- 
tions the experimenter keeps two things in 
mind: to get a subject maximally involved 
in solving the problem and to do this with 
a minimum of guidance from outside. 

Neutral instructions are suspect when 
they are evaluated by this two-edged 
criterion. In the absence of specificity, both 
of instruction and of materials (nonsense 
strings), a subject may often resort to self- 
instruction, either to work for speed, or to 
find some easy-way-out strategy; he there- 
by loses much information. To overcome 
this difficulty it seems necessary to furnish 
a subject with some other kind of instruc- 
tion. 

French (1954a) has shown that perform- 
ance improves with more specific instruc- 
tions in a task that involves sequential de- 
pendencies. The question here, however, 
is not one of improving a subject’s per- 
formance by giving him more guidance or 
by making the instructions more specific— 
we take that for granted—but rather the 
question is one of finding the kind of in- 
structions that will induce a subject to 
make the maximum effort to solve the prob- 
lem, and to bring this about with as un- 
structured a kind of working instruction as 
possible. 

We noted in the results that when a 
language is too difficult for the subjects, as 
(1IT,3,23),.. seemed to be, the effect of in- 
structions is confounded. But with the 
simpler Grammar I, subjects do better 
when they are instructed to slow down, 
with the one notable exception of (1,3,16) 
—see Condition A. We pointed out earlier 
that the results of Condition A are the 
poorest of all the conditions tested. A 
differs from B, C, D, and FE in the size of 
the vocabulary; it differs from F, G, H, 
and I in its grammatical structure; and its 
difference from Condition J seems to be 
clearly related to the number of instances. 
Presumably, A suffers in part from one or 
more of these conditions. 
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The difference between B and Conditions 
C, D, E, and J was attributed to different 
instructions. All four conditions, C, D, E, 
and J, required subjects not to rush (note 
that Relations and Logical also eliminate 
speed), and they all emphasized that it is 
what a subject does that is more impor- 
tant than speed. Condition B is neutral to 
these emphases. It does not seem unreason- 
able, therefore, to suppose that the em 
phasis on “what you do” increases the search 
for useful information, and this search for 
the useful information, we have granted, is 
one critical step in pattern conception. 

Johnson (1955) discusses the effect of 
instruction on performance under the con 
cept of “set” (see pp. 64-67). After de 
fining set as “temporary integration of be 
havior,” he elaborates on its meaning as in- 
tegration in terms of the obvious and simple 
things, or integration in terms of “subtle 
aspects of complex stimulus patterns.” Ex- 
plained in this way, set seems to be no 
different from recoding of information. 
Verbal instructions can drastically change 
the quality of recoding. As for instance, in 
the Carmichael, Hogan, and Walter (1932) 
experiment the same figures are recoded as 
dumbbells by one group and as spectacles 
by another group of subjects, depending 
upon the instructions the group received. 
In studies such as this one, the experi 
menter has already laid down the recoding 
scheme for his subjects through the in- 
structions he provides, and then he is sur 
prised (and pleased perhaps!) to find that 
subjects have followed his scheme. 

It is this kind of coaching in the re 
coding scheme of a subject that we wished 
to avoid in our instruction. For, after all, 
a major interest in the present research is 
to study those factors that affect the re- 
Very little 
new information will be gained by recover 


coding scheme of a subject. 


ing the same coins that we originally fed 
into our slot machine. 

So, what do we finally conclude? On 
the whole one could say that the instruc 
tions, to the extent to which they were 
varied, did not produce a strong effect on 
the quality of the performance. Other 


variables (complexity of grammar, num- 
ber of different alternatives, number of in- 
stances) seem to be more important be- 
cause they can make the problem so hard 
that they mask out the effects of instruc- 
tions. When conditions are optimal for 
instructions to have an effect, the main 
effect that changed instructions produced 
was to slow subjects down to make them 
stop, look, and think more carefully, and 
this is the only effect the experimenter 
wished to induce through instructions. Any 
more help than just maximally motivating 
subjects to think would be a violation of 
what has been described as good instruc- 
tion. Since the instruction to eliminate 
speed, together with the emphasis on “what 
you do,” is common to all three sets of in- 
structions tested, it seems reasonable to 
suppose that this amount of change in the 
neutral instructions was enough to involve 
the subject in the task. Therefore, we 
adopted Eliminate Speed instructions as 
standard for all subsequent studies. 


The Number of Instances 
(Experiment IV) 


Vaterials and Procedure 


One important characteristic of an ensemble of 
instances is its size. The size of the set can be 
controlled by controlling the number of formulas 
of a grammar and/or the number of instances 
given for each formula. Moreover, one might 
like to control the total number of words in the 
ensemble. The structure of the languages used 
in this study is clarified in Table 6. 

The Random condition contained no formal 
grammar; it served as the control group for all 
the other conditions. Languages B, C, and D are 
identical except for an orderly increase from two 
instances per loop in (1,3,13) to three instances 
in (1,3,18) to four in (1,3,23). The number 
of instances per loop and the size of the set are 
interrelated; as instances increase in number, the 
size of the set increases also. 

Languages (1,3,13) and (1,9,13) are also identi- 
cal except for the difference in the size of 
the vocabulary: B is developed with three words; 
E with nine. Thirteen instances for Grammar I 
have been considered to comprise a minimum set 
by definition. Condition F, (1,9,16)», represents 
a random increase in the number of instances 
from 13 to 16. The choice of the size, 16, was 
made to match the size of the Random set. Con 
ditions E and F will be compared for a small 
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TABLE 6 


STRUCTURAL ANALYsIS OF LANGUAGES 




















Grammar or Vocabulary or Total 
Language number of number of Instances Size number of 

formulas different words per loop of set words 

A Random 0 3 1 16 118 

(0,3,13) 

B 5,35) 3 3 2 13° 90 
C (13,18) 3" 3 es as oS eee 
D (1,3,23) 3. 3 4 | 23 «| (180 
E (19,13) 3s 9 2 oe er 
F (1,9,16) 3a 9 Irregular wri a 
G (III,3,23)c 5 3 2 es aoe 
H (III,9,23)<. 5b 9 2 23 ~—«| «(180 


* Same as in B. 
b Same as in G. 


increase in the number of instances from 13 to 
16, other things being held constant. 

A similar comparison in size is envisaged for 
Languages (111,323). and (III,9,23).. Gram- 
mar III consists of five formulas as against three 
of Grammar I. A minimum set for Grammar ITI 
consists of 23 instances, which also allows 2 
instances per loop. All experimental conditions for 
these two sets were held constant, except for the 
size of their vocabulary. 

One could also compare (1,3,13) and (III, 
3,23). with each other as minimum sets for 
their respective grammars. Since the number of 
instances per loop has been held constant, any 
differences found between the two languages would 
be attributable either to the number of formulas 
in the grammar or to the number of instances in 
the sets, or to both. 

Ten subjects per condition, using the instruc- 
tions aimed to eliminate speed, were run for each 
of the eight language conditions. 


Results 


Table 7 presents the means and critical 
values for 10 language conditions by the 
usual three measures. Only 8 of these 10 
groups, A through H, concern us in this 
section. We note first of all that in Gram- 
mar I, Conditions B, C, and D, as the in- 
stances per loop increase from two to three 
to four, strategies improve correspondingly. 
In. working with 13 instances a_ subject 
spends about the same amount of average 


time per word as a subject working with 
the larger sets of 18 or 23 instances, but 
he does not achieve better results. In fact, 
the strategies used under this condition are 
poorer than most and about as poor as the 
ones used by the Random group, which 
used the poorest strategies. 


TABLE 7 


MEANS, RANGES, AND CRITICAL VALUES 


F eahed 


| A-D | Time 

Language | ratio | R score | score 
Group eee ee | —_— 

M | M || a ae 
= —| = 7 = 
A Random | 

(0, 3, 16) ~ | 2.8 | 6 /11.38 /49.24 
B (I, 3, 13) | .253 | 3.1] 7 | 8.14 [29.93 
C (I, 3, 18) 389 | 4.6 | 7 | 6.71 |23.02 
D (I, 3, 23) .598 | 6.5 | 8 | 7.73 |14.27 
E (I, 9, 13) .532 | 6.7 | 6 | 4.996] 8.29 
F (1, 9, 16), -748 | 8.2 | 7 | 3.916) 4.23 
G (IH, 3,23). .419 | 4.7 | 8 | 5.396) 7.31 
H (III, 9, 23), .706 | 6.0 | 7 | 5.035) 6.53 
I (III, 3, 23)a 541 | 3.5 | 8 | 4.998)11.05 
| (III, 9, 23)a 81 | 7.1 | 7 | 5.47814.39 

Critical Value 33:37 79.08 
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Next we note that without any increase 
whatsoever in the number of instances, if 
we merely changed the size of the vocabu- 
lary from three words to nine, performance 
is drastically modified (cf. Column B with 
E and G with H). Now look at the time 
measures. Subjects in Column I take far 
less time than subjects in Column B and 
yet they do far better than subjects in 
Column B. The same is again true, though 
not quite to the same degree, of subjects in 
(111,3,23), and (III,9,23),. Apparently, 
one must look for some other factor(s), in- 
stead of the number of instances, to ac- 
count for these differences between the 
groups. 

A further point of interest may be noted 
from a comparison of Languages (1,9,13) 
and (1,9,16),. Earlier we explained that 
the two languages are identical in ever) 
respect except for a fortuitous increase in 
the number of instances from 13 in Condi 
tion E to 16 instances in IF; the results, 
however, that follow this change are not so 
fortuitous. Even a small random increase 
in the number of instances in a language 
with the larger vocabulary apparently makes 
some difference in the subjective redundancy 
of the set for the subjects. But when we 
compare Columns A and F, both sets with 
16 instances in each, the Random set scores 
are much inferior. 

Now compare the minimum sets for both 
grammars. We consider 13 instances fo! 
Grammar I and 23 instances for Grammar 


III, as minimum sets for each, by definition 
(1.3.13) is not too different from (ITI,3, 
23), nor is (1,9,13) very different from 
(1I1,9,23).. Presumably, the effect of the 
size of the set is either negligible or it 
masked by the effects of other variables. 

To summarize our observations, three 
points seem rather clear: the larger vocabu 
lary decreases difficulty, more instances per 
formula also decrease difficulty, the effect 
of the total number of instances in a set is 
easily modulated by other variables. 

It is not clear, however, as to what effect 
an increase in the number of formulas pro 
duces upon subjective difficulty. 


Discussion 

Two of the four points summarized 
above pertain to the number of instances in 
the set; they are not unrelated points. The 
total number of instances in a set is auto- 
matically increased with each new addi- 
tion of an instance for any of its formulas. 
This means that two things happen for 
a subject. First, every single addition of an 
instance to the set increases the difficulty 
of the set by increasing the number of rela- 
tions the subject must now study (if n 
cards have m relations, then n+1 cards will 
have m+n relations). Second, every addi- 
tion of an instance for any of the formulas 
contained in the set simultaneously re- 
duces difficulty by increasing the amount 
of evidence available. This latter gener- 
alization, however, is not true for a random 
set of strings; an increase in the number 
of instances in the Random set does not 
reduce difficulty, but rather increases con- 
fusion. 

In the context of the present discussion 
we define redundant strings as those that 
have been derived from the same gram- 
matical formulas. They repeat information 
by replicating formulas or instances or 
both. Random strings, on the other hand, 
contain no formulas or any orderly repeti- 
tion of information. They could be viewed 
as containing as many formulas as there 
are instances in the set, as was suggested 
previously; as such, they will contain only 
one instance per formula. A redundant set 
which has been generated with one, two, or 
more formulas will contain instances which 
repeat information about the formulas. 
Thus the number of instances per formula 
and the number of formulas that constitute 
the grammar are somewhat interdependent 
features in a language. 

Random strings are confusing to a sub- 
ject because they are difficult to package. 
They contain more information than any 
redundant language of comparable size 
contains (see Columns A and F) and this 
mass of disjointed information is difficult 
for a subject to handle. The larger the size 
of the Random set the more difficult it is 
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for a subject to process the information it 
contains. Thus, the critical factor for in- 
creasing the efficiency of performance is 
not one of increasing the amount of hap- 
hazard information, but it is one of re- 
ceiving a greater amount of more useful 
information. Reiss (1958) came to the 
same conclusion with his subjects when he 


varied the amount of initial information 
presented. 
In the absence of repetition (redun- 


dancy) of information, relations between 
instances are few and far between. The 
subject, who is usually disposed to sorting 
his environment by searching for redun- 
dancy, is nonplused in a random set-up. 
Now introduce some redundancy into the 
situation by presenting to the subject two 
instances per formula, instead of one in- 
stance or none, and performance improves. 
Bartlett (1958) made this same observa- 
tion on his subjects in tasks that required 
gap-filling by methods of interpolation and 
extrapolation. In the case of a simple 
formula, a string with one loop only, if one 
now gives the subject just one more in- 
stance, making it three instances per loop. 
the improvement is phenomenal. The sub- 
ject’s response almost begins to approxi- 
mate to the grammatical structure of the 
language. One is led to think that de- 
pending upon the amount of complexity 
of the formulas, one could expect an upper 
ceiling in improvement in performance be- 
yond which an increase in the number of 
instances ceases to be effective. 

When the number of instances per loop 
is small, say zero or one, it is more possible 
to confuse the instances of one formula 
with the instances of another on the basis 
of superficial relations—what Wertheimer 
(1945) calls “and-summative aggregations” 
(p. 190). Actually in the ideal situation, 
where instances have been grouped accord- 
ing to the formulas they exemplify, in- 
stances which are positive for one formula 
are negative for all other formulas in the 
set. This, however, is true only when a 
subject has clearly seen the structure of 
the formulas in abstraction. Pattern con- 
ception requires the observer to make a 


two-fold response to the stimulus ensemble ; 
a response of the ingroup type and of the 
outgroup type. The positive instances for 
a formula the subject must “aggregate”’ 
as members of the ingroup for that subset ; 
all other instances become negative in 
stances for this formula and which he 
must “segregate” as the outgroup members 
belonging somewhere else. In effect, the 
subject is making an inductive-deductive 
response. Hypothesis making and hypoth- 
esis testing are going on simultaneously 
as a subject decides which instances art 
positive and which are negative for the 
particular category he is working with. His 
certainty increases as the number of in 
stances that fit his hypothesis increases. 
More instances per formula increase evi- 
dence and consequently decrease the proba- 
bility of error. It is a well-known fact in 
communication engineering that one re- 
peats a message as a safety device against 
“noise” in the channel. The noise which 
the negative instances create for a subject 
in our channel can be reduced by increasing 
the number of positive instances, other 
things being equal. 

Thus we are led to conclude that within 
certain limits, the number of instances per 
formula reduces difficulty, subject to cer- 
tain interaction effects from other variables. 
Although an increase in the size of the en- 
semble, within a reasonable range, increases 
the difficulty of the set, the quantitative 
aspect of the set is relatively unimportant. 
If, however, the size of the set is too large 
the problem is either rendered insoluble, 
or it becomes ridiculous, depending upon 
the number of basic strings in the set. 

The interaction effects we referred to 
above have pointed in the direction of the 
size of the vocabulary and of the com- 
plexity of grammar as two possible vari- 
ables that affect pattern conception. In 
fact, one of the points we noted from our 
results was that a large number of differ 
ent words reduced the difficulty of the task 
for the subject. We could not see our way 
clearly on the question of grammatical 
complexity. Let us explore the effects of 
these two variables further, 
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The Number of Different Alternatives and 
Structural Similarity 


(Experiment V ) 


Two variables that have been clouding 
the analysis of our data are the parameters 
of the number of different words and of 
structural similarity of the strings. We 
have equated the number of different alter- 
natives with the vocabulary of a language, 
and its grammar, roughly, with structural 
similarity. Two sets of alternatives, 
sisting of three words and of nine words, 
constitute the size of the vocabulary of the 
languages to be tested. The generators for 
these languages consist of three basic 
strings, five basic strings with replication 
and five basic strings without replication. 


con- 


The Random set of strings has no grammar, 
of course. 


Materials 


Two additional languages, (III,d), with vocabu- 
laries of three and nine words, respectively, were 
constructed to control similarity in the formal 
structure of the grammars. By similarity in struc- 
ture (in this context) is meant: (a) One gram- 
mar contains some information about the other 
grammars. For instance, Grammar (III,d) con- 
tains five formulas, three of which are identical 
with the three formulas of Grammar I, and with 
three of the five formulas of (III,c). (6) Struc- 
tural similarity also means that one formula 
contains some information about the other formulas. 
For example, Formulas 4 and 5 in Grammar 
(III,c) repeat Formulas 2 and 3; in (III,d) 
they do not. This explanation of the structural 
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(1,9,13) E | 3.93 | 3.43 | 1.8 15 2.45 | .58 | 1.69 | 10 | 2. 88 
(1,9,16) F | 6.28 | 5.93 | 3.49 | 1.45 | 1.51 1.79 | 52| 3.35 | .86 
(I11,3,23). G | 2.02 | 1.55| .04 | 1.42 | 1.90 | 3.60 | | 1.51 | .66 | 2.06 
(III,9,23). H | 2.66 | 2.50/1.10| .36| .57 1.99 | | 1.09 | | 3.25 | 1.16 
(111,3,23)a 07 | .03 | 1.90 | 2.35 | 2.89 | 4.74 | 1.10 | 2.08 | 3.68 
(111,9,23)a J 3.61 | 3.30 15 42 31 94 | 1.90 81 | 2.85 


Note.—+t values for A-D ratios are entered in upper triangle. 
not directly comparable for different sized sets, A-D ¢ values 
p= .l .05 .02 

t = 1.734 2.101 2.552 


R score et values are entered in left triangle. Since re D ratios: were 
apply for sets of equal size. 


.01 .001 df = 18 
2.878 3.922 








wr 


a a ae a. 


| 


—_ aa — ae of 


om am 


imar, 


»cabu- 
were 
ormal 
struc- 
gram- 
other 
con- 
ntical 
with 
Struc- 
rmula 
nulas. 
mmar 
IId) 


ctural 


= (I11,9,23)a__ | 





were 














SOME 
similarity in the grammars can be schematized 
thus: 


Grammar 1 Grammar (III,c) Grammar (III,d) 


1, - (-)--- -(-)--- 1. - (-)--- 
et ee ee as Soe eee 
3.-(-)--()- 3. -@)--@)- 3.-O--()- 
4.--(--)-- 4. ---((-)(-))- 
5. -(-)--(-)- 5. -¢)-©)-©- 
Results 


In Table 8 are shown results of the ¢ 

tests computed to evaluate the significance 
of the differences between pairs of condi- 
tions. It can be seen from this table that 
performance is significantly better with a 
vocabulary of nine words than of three 
(cf. B with E, G with H, and I with J). 
But when the grammar changes from three 
basic strings to five, R scores remain pretty 
much the same. 
The A-D ratios rise automatically with an 
increase in the size of the set, as we noted 
earlier, so that the differences do not re- 
flect a difference in the subjects’ perform- 
ance. Time, except for (1,3) remains con- 
stant for the several conditions. 

In comparing the minimum sets of 
(1,3,13) and (1,9,13) with each other, we 
are impressed with the difference between 
the two conditions as evident from all three 
measures in Table 5. The reader will recall 
that these sets are identical in every respect 
except for vocabulary; the number of 
different words, then, does affect the nature 
of the strategy used. But when we turn to 
Language (1,9,13) and (1,3,23) which vary 
in two parameters—in vocabulary and in 
the number of instances per formula—we 
notice that the R score differences vanish, 
that the A-D ratios for (1I,3,23) should be 
(and are) higher by virtue of the size of 
the set, and that time differences now ap- 
pear. 

The Random group and the (I,3,13) 
group arrive at similar strategies but at 
divergent speeds (cf. Columns A and B in 
Table 8). 

Table 9 compares two types of strategies 
in terms of speed of sorting. Individual 
data have been clipped for the two ex- 
tremes of the strategies rated as low and 
high. Column A is a listing of the amount 
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of time spent per word by subjects who 
were rated low, that is, who scored either 
1 or 2 on their strategies, and Column B 
presents the data for all subjects who 
scored 9 or 10. We remind the reader that 


TABLE 9 


TIME SPENT PER SYLLABLE BY INDIVIDUAL SUBJECTS 
ON STRATEGIES RATED BY THE R Score SCALE AS 
1 AND 2, AND 9 AND 10 






































A 
| Strategies | snename 
Condition | rated low rated high 
| (1 and 2) | (9 and 10) 
Random | 1.61 
(0,3,16) | 2.49 
3.09 
1.89 
20.09 
Median 2.79 
(1,3,13) 1.89 
3.67 
4.90 
$37 
Median 4.285 
(1,9,13) 9.03 
10.18 
wate 
7.56 
Median | 8.245 
(I11,3,23), | 2.61 §.32 
III,9,23)¢ | 5.44 
6.96 
7.31 
Median | 6.96 
(111,3,23)a | 8.58 
| 1.24 
1.54 6.19 
| 3.22 
Median | 2.38 6.19 
(IIT,9,23)a 4.05 
5.41 
343 
| 4.76 
|} 14.96 
| 4.20 
Median 4.48 
Median—All 


groups 2ia0 6.19 
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strategies which have been rated low are 
based upon a counting device of some sort ; 
those rated high, either parallel or are 
equal to the experimenter’s grammar. 

Several things seem obvious from Table 
9. The poorer strategies go with the 
smaller vocabulary, not with the larger one 
(see Column A). In the strategies rated 
high (see Column B) there is either no 
subject, or only one subject, per condition 
under a vocabulary of three words. Also 
note that a subject spends more than twice 
as much average time per word in the better 
strategies as a subject using the less effi 
cient ones. 


Discussion 


The Number of Different Alternatives 
(Vocabulary). The results indicate that 
the size of the vocabulary does affect pat- 
tern conception. We notice that when nine 
different words were used the subject was 
able to decode and classify information 
more readily than when only three differ- 
ent words were used. Since pattern con- 
ception is a matter of seeing meaningful 
relations among the parts of a set of mes- 
sages, the function of the number of differ 
ent alternatives should be to convey infor 
mation to the decoder about these rela- 
tions by pointing up similarities and differ- 
ences among the messages. Information 
about similarity is usually conveyed 
through repetition, and about the difference 
by the use of a different symbol. When the 
number of alternatives is too small, the en 
coder is forced to use the same symbol re 
peatedly even when the messages are differ- 
ent. His problem is initially the same as 
that of an artist who has to paint a multi 
colored landscape but who only has one, 
or maybe two, colors to paint with. When 
the number of alternatives is small, for ex- 
ample, three as in some of our languages, 
there is interference in grouping the mes 
sages because of the exaggerated similari 
ties among the sequences. With a larger 
number of alternatives, for example, 
larger than nine, one can again expect an 
interference in grouping the messages, but 
this time the interference will be due to 


the exaggerated differences among the 
sequences. This again will tend to obscure 
the pattern. Perhaps between these two 
extremes of too few and too many alter- 
natives there is a number which serves the 
ideal function of emboldening the pattern 
for the decoder, other things being equal. 

Individual data for 3-group (any gram- 
mar group tested with three alternatives) 
show that when faced with haphazard simi- 
larities subjects resort to some kind of a 
counting device. Five subjects out of 30 
in the 3-group followed the Length 
strategy (counting) and 12 subjects in the 
same group used counting of some other 
form. Thus a total of 17 subjects out of 
30 followed some kind of a counting de- 
vice. None of the subjects in the 9-group 
used the Length strategy and only 5 sub- 
jects used some shade of the counting 
strategy. Seventeen subjects in the 3-group 
as opposed to 5 in the 9-group is not too 
small a difference to overlook. Subjects in 
the 9-group, we suggest, can decode and re- 
code the information better, hence their 
strategies are more closely related to the 
kind of information that is contained with- 
in the set. The strategies of these subjects, 
therefore, correspond more nearly to the 
grammar of the language. 

In Grammar III we notice that although 
subjects spend the same amount of average 
time under all four conditions they do not 
employ equally good strategies in all the 
four conditions. Strategies used by the 9- 
group are more successful than the ones 
used by the 3-group. Thus once again evi- 
dence points to the number of different 
alternatives as a critical variable. 

Now turn to Conditions B and D in 
Table 7 to make an intralanguage compari- 
son within Grammar I. The results suggest 
that there is an interaction effect between 
vocabulary and the number of instances 
for each recursive loop in a basic string. 
Although the number of alternatives re- 
mains three, performance improves with an 
increase in the number of instances (see 
D). With four instances for each loop, 
subjects perform almost as well as sub- 
jects of (1,9,13) ; they take more time with 
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the set, however. Whether this increase in 
time for the (I,3,23) group is due to the 
larger size of the set or because it takes 
longer for the 3-group to find the pattern 
than it takes the 9-group, it is hard to say. 
Since groups (I,3,13) and (1,3,23) take 
the same amount of time for different sized 
languages, it could be argued that set size 
is less critical than the vocabulary factor. 


Structural Similiarity (Grammar). The 
other notable parameter in pattern concep- 
tion seems to be the structural similarity 
of sequences. In their studies on visual 
pattern recognition, Fitts et al. (1956) 
raised the problem of “different types and 
degrees of redundancy and of examining 
the effects of complexity of variables in- 
fluencing pattern recognition” (p. 9). 
Although the measure of redundancy does 
not apply to our materials, as pointed out 
earlier, the notion of repetitiveness does. 
By substituting the phrase structural simi- 
larity for redundancy in the quotation from 
Fitts et al. we can state our own problem 
thus: What is the effect of the type and 
degree of structural similarity on pattern 
conception ? 

Similarities among sequences 
upon two kinds of repetitiveness: the repe- 
tition of words because of the limited 
vocabulary used and repetition due to the 
grammatical structure of the strings. In 
the previous section we discussed repetitive- 
ness based upon the number of different 
alternatives. Structural repetitiveness was 
defined in terms of the number of basic 
strings contained in each language and the 
number and placement of the loops within 
the basic formulas. A particular grammar 
selects a particular type of repetitiveness. 
The problem arising from such selection is: 
Is the pattern of similarity and difference 
more difficult in one set than in another? 

In scaling the languages for structural 
similarity, one could begin with the zero 
condition in grammatical rules, as in the 
random set of strings, where rules of for- 
mation are absent. Results show that in the 
absence of structural similarities among the 
strings a subject searches for peripheral 


depend 


similarities and he settles for the simpler 
strategies such as those of Length or of 
Initial similarity. Peripheral strategies, 
however, are not solely the function of 
nongrammatical structure. We _ noted 
earlier, that when the number of alterna- 
tives is small, as in (I,3,13), that even 
when the language has a grammar, the 
grammar of the set becomes so obscure 
through word repetition that a subject 
treats the set as a random one. In such an 
instance, the strategies used for a gram- 
matical set will approximate those of the 
random set. 

When we turn to the finite state sys- 
tems, which of course are grammatical in 
structure, structural similarities consist in 
the number and kind of formulas con- 
tained within the grammars as explained 
before. Some questions that now arise are: 
(a) When a system is enlarged by adding 
one or more parts, do we increase variance 
or its invariance or both? Thus, are three 
formulas easier to handle than five? (b) Is 
the quality rather than an increase in the 
number of parts a critical factor in strategy 
construction? Are five rules with replica- 
tion better than five rules without replica- 
tion? More specifically, is Grammar 
(III,c) more simple than (1,3) and 
(III,d) or is it more complex? 

The results do not give any clear indica- 
tion either as to the degree or the direc- 
tionality of the grammar effects. In the Ex- 
ploratory Study, the data indicate some im- 
provement in performance with the in- 
crease in the number of basic strings, from 
two strings to four, and to five (see Table 
3), but the time also increased correspond- 
ingly. The better controlled experiments of 
the later studies give no clear indication of 
significant change in performance with an 
increase in the number of basic strings, the 
number of states, and the number of rules 
of transition. The relative difference in the 
grammar effects based on these factors is 
hard to assess for two reasons: first, there 
are interaction effects of other variables 
with grammar ; second, there is no adequate 
measure of structural similarity. Since 
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grammatical similarity is more structural 
than peripheral, it can easily elude a sub- 
ject but unless a subject takes advantage 
of grammatical repetitiveness implicit in 
the language, he fails to see structure. How 
well he will work with the grammar of the 
language will depend, in part, upon the way 
the messages have been encoded and, in 
part, upon how often they have been re- 
peated. Without a sufficient number of 
different alternatives to indicate similarities 
and differences among the parts of the sys 
tem, and without an adequate number of 
repetitions of the messages, the grammar 
of the system will remain obscure to a sub- 
ject and its effect almost negligible upon his 
recoding behavior. On the other hand, 
when a subject does figure out the gram- 
matical principle in any one subset, it 
seems to matter less how many subsets there 
are in the entire system. It then becomes 
simply a matter of transfer of a principle. 
Essentially, we can argue that there is onl) 
one basic principle of variance and invari- 
ance involved in a finite state system. All 
the formulas—either one, or three, or five, 
or any number—are merely variations of 
this single principle of additions or dele 
tions of symbols to a basic, invariant, se 
quential pattern. Usually, a subject applies 
one method of classification fairly con 
sistently to all instances in the set. If this 
method of classification happens to approxi 
mate to the experimenter’s grammar, it 
seems to matter little whether three for 
mulas have been built into the structure of 
the language or five. And if his strategy is 
different from the experimenter’s grammar, 
then it matters even less how degenerate 
the grammar is, because the subject has 
missed the point of the grammar, anyhow 

We would like to venture another guess 
to explain the absence of a clear difference 
in the Grammar groups as measured by the 
kind of strategies subjects used. Presum- 
ably, each time one or more parts is added 
to a grammatical system, the information 
function is jointly and _ simultaneously 
affected by the increase in complexity and 
redundancy of the system. In other words, 
confusion and facilitation go hand in hand 
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in grammatical redundancy and these effects 
tend to cancel each other. Thus more for- 
mulas neither make the task easier nor do 
they make it more difficult. 

Although the quantitative factors of 
strings, states, and rules of transition seem 
to make little difference in performance in 
the present study, recursive loops create 
some problems for a subject. Two common 
difficulties pertaining to loops characterize 
the responses of most of our subjects. 
These difficulties we shall label as: the 
zero concept difficulty and the double-loop 
concept difficulty. 

The zero concept: It was not infrequent 
to find a subject puzzled about the basic 
string of a formula. For example, if one 
of the formulas in a grammar was a (c) 
b b a, its basic string, a (c)° b b a, was 
often held as a separate card for a con- 
siderable period of time and was even 
placed in a separate category by some sub- 
jects. This observation seems more typical 
of the double-loop formula to be discussed 
below. When a subject had decoded a 
grammatical rule as the experimenter had 
structured it, he could easily see the arith- 
metical progression in the development of 
a loop move from 1 to #, even though quite 
deliberately some exponents in the progres- 
sion had been left out. But it seemed diff- 
cult for a subject to see that the exponent 
of the loop could also be zero. Viewed 
historically, the zero concept was not easily 
come by either. Perhaps it is difficult for 
one to think of starting a series with 
nothing. Normal counting frequently be- 
gins with 1 and moves up to n. It is, there- 
fore, quite possible that this way of count- 
ing interferes with the necessity to begin 
the series with the zero notation in our 
formulas. 

The double-loop concept: The zero 
concept seemed even more difficult for sub- 
jects to handle in the double-loop for- 
mulas. This was so because the double- 
loop in itself was a hard concept for the 
subject. We have described the double- 
loop as a formula consisting of two vari- 
ables, instead of just one, within an in- 
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variant sequence of symbols. It has been 
schematized thus: a (c) bc (b) a. 

An R score of 8 or 9 signifies that sub- 
jects broke up the double-loop subset into 
three further sub-subsets as _ follows: 
a (c)* bcea,abc (b)*” a, and a (c)*” 
b c (b)?” a. The basic string was either 
held as a separate category, or it was placed 
with one of the first two sub-subsets. In 
watching subjects at work, it was fairly 
obvious to the experimenter that subjects 
were having considerable difficulty with the 
double-loop subset. They insisted upon ap- 
plying the simple rule of one loop varying 
from 0 or 1 to m to this complex formula, 
and were confused with the similarity within 
the three sub-subsets. This can be very 
clearly seen in the thinking-aloud responses 
of our subject in the Illustrative Study. One 
could explain this difficulty on the basis of 
two characteristics of pattern response. The 
first, that the subject formulates his method 
early in the game and more or less stays 
with it; and the second, that the subject 
seeks to apply one pattern consistently 
throughout to all parts of the system. 

Reiss pointed out that a subject con- 
structs his “model” fairly early, and then 
begins to stabilize around it as he tests it 
with successive negative or positive in- 
stances. From the thinking-aloud responses 
of our subjects and on the basis of data 
from other studies on the evolution of 
strategies not reported here, Reiss seems to 
be right. Those subjects who initiated a 
counting strategy in the beginning showed 
a strong counting bias, and those who used 
the initial strategy stuck pretty closely to 
the left-to-right rule of comparing se- 
quences. Exceptions did occur in the Ran- 
dom group, or when the method of pres- 
entation of instances was varied. In the Ran- 
dom set, one could find the subject trying 
all sorts of strategies, and finally settling 
on a perceptual one, when nothing else 
seemed to work. But in the sets where the 
subject had once perceived the structural 
order in a subset fairly early, or even later, 
he did not give it up. This does not mean, 
of course, that there is no further refine- 
ment of the strategy with an increase in the 


amount of evidence. It simply means that 
more instances only help to sharpen or 
modify the “model” already constructed. 

This kind of fixation in strategy ties in 
with the point we are making about con- 
sistency in sorting. By and large, most 
subjects work with one method of classi- 
fication for all the subsets they make. They 
do not seem to want to shift gears in their 
strategy from subset to subset. This 
characteristic way of responding is inde- 
pendent of the input, and it seems to give 
us some information about the human chan- 
nel itself. It is, therefore, important to 
note that not only does a subject organize 
the input sequences into categories, but 
that he does this on the basis of a con- 
sistent principle. 

Time 

Since time did not seem to be a simple 
function of a single variable we have post- 
poned its discussion; a few remarks are in 
order now. We note first, that the amount 
of time taken for the task is not a unique 
function of the size of the set although 
size may have something to do with it. 
Next we note that time seems to be posi- 
tively correlated with strategy. And finally 
we see that time is affected by the size of 
the vocabulary, though not without excep 
tion. 

Time and Size of Set. 
the relation between time and size of set. 
One is impressed by a lack of orderly vari- 
ation in the amount of time corresponding 
to an increase in the size of the set. The 
group total time spent on a set of 13 instances 
was 49.56 seconds for the larger and 80.36 
seconds for the smaller vocabulary. When 
the size of the set increases to 16 (larger 
vocabulary), time reduces to 39.16 seconds, 
and when the size increases to 18 instances 
(smaller vocabulary), time goes up to 67.16 
seconds. Subjects take more time with a set 
of 13 instances developed with a vocabu- 
lary of three words, than they do with a 
set of 23 instances irrespective of vocabu- 
lary size. The small number of instances 
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more manageable, but rather, it seems to 
make it more difficuit, possibly by obscur 
ing the pattern. A small set of instances 
gives him less evidence upon which to base 
his induction of the appropriate categories. 
The variables which interact with the size 
of the set apparently are far more critical 


It is difficult to 
function on the 


limits) of the set itself. 
determine the true time 
basis of adding just one more ins 
the set. It seems one must know what that 
one more instance is, and also, what some 
of the other characteristics of the set are 
Time and Strategy. The 
measures are correlated with the more su 
cessful strategies. Even at the risk of ex 
plaining the obvious, we shall state our 
point: complex strategies take more think 
ing and thinking takes time. We define 
thinking in this situation as a search for 
relations among parts of the information 
received, in order to recode this informa 
tion into suitable units for use and for 
storage. The human link in any communica- 
tion system is responsive not only to the 
immediate reception and use of informa 
tion—its function is a far more comple? 
one of storing this information for later 
use. As a finite system, the organism is 
confronted with an infinite amount of i 
formation from its environment ; the quality 
of its efficiency is on trial on the basis of 
the recoding scheme the system envisages 
How much and how well the old informa 
tion can be reconstructed from the recoded 


of efficiency 


tance to 


longer tim 


units will determine the degre 
of the 
strategy makes the 


thinking involved. A_ structural 


reconstruction of 
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original message far more probable 1 
simple counting strategy, for instance. It 
is, therefore, not difficult to see why i 
would also take more time. Even Donders 
(cited in Boring, 1950, pp. 147-149) was 
suggesting a hundred years ago that th 
longer reaction times were an indication of 
complex processes. We might speculate at 
this point and add that if we could force 
each subject to work with the 
given in the set for a longer period of time 


information 


(as we forced the subject described in the 


Illustrative Study) by presenting the in- 
formation to him one instance at a time, 
we would expect more subjects, possibly all, 
to come closer to the original structure of 
the grammar, despite the complexity of the 
stimulus conditions. For example, one 
subject in the (1,3,13) group spent next to 
the longest time of any and came up with a 
better solution than any of the other sub- 
jects in that group. Give a man sufficient 
time and he will find some way to reduce 
the heterogeneity of his environment to 
lawful change. 


CONCLUDING OBSERVATIONS 


[t is clear from the data that pattern con- 
ception is a characteristic way in which 
people respond to a set of linguistic se- 
quences, irrespective of the structure of 
the sequences. In an “open system’’ such 
as ours (unstructured in instruction and 
in meeting any given criteria) the subject 
must define the problem himself, construct 
some sort of working hypothesis to deal 
with the information he has extracted from 
the input, and look for evidence to see if 
his hypothesis holds. He then discovers 
gaps in the information he has acquired, 
he searches for new €vidence to fill the 
gaps, and finally he accepts his plan or 
rejects it if the gaps remain unfilled. 
Usually, though a subject may show in- 
tolerance of uncertainty, he may occasion- 
ally decide to live with his problem. The 
strategy or plan he settles on is partially a 
function of some experimental variables. 

It was interesting to observe that remi- 
niscent of the Kantian categorical impera- 
tive, our subjects seemed to be bound by a 
categorizing imperative. And what is more, 
their categorizing seems to fall within a 
limited range. Whether there were 13 in- 
stances in a set, or 50, and whether there 
were 15 subjects tested or 150, it did not 
seem to matter; the mean number of cate- 
gories ranged between five and seven (sce 
Figure 4). By some strange coincidence, 
the integer that has been “persecuting” 
Miller (1956) for a decade, turns up once 
again in our data. What does one make of 
tg 
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ligures 4, 5, and 6 all relate to the num-  ¢,._,_,_, 
ber of categories subjects make. The func- 2 SRE OF SET NN RANG 
tion in Figure 4 represents data for the © ae ee Ee ee 
mean number of categories subjects made ¢ oy a0! Se 
for the different sized sets. Data show 3 
that the number of categories increases > 
somewhat with an increase in the totalnum- & 
ber of instances in the set, but that this in- 4 
crease is proportionately less. 3 
The average frequency of the different % Seer rst 
INSTANCE CATEGORIES 


types of categories subjects make is pre- 
sented in Figure 5. By the type of category 
is meant the number of instances subjects 
put into one category; for example, there 
may be a one-instance category, a category 
with two instances in it, with three, and so 
on. The number of instances in a category 
and the total number of categories are in- 
terdependent factors, of course. Either a 
subject prefers to make a certain number 
of categories, or he prefers to put a certain 
number of instances together in one cate- 
gory. The more frequent categories are 
those with two, three, or four instances 
per category. 

Figure 6 treats the same data in a slightly 
different way. It presents individual cumu- 
lative curves for different language condi- 
tions to show the probability that a subject 
will make at least x categories. Note that 
for all conditions 50% of the subjects aver- 
age between five and six categories. 
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NUMBER OF INSTANCES 
Fic. 4. Mean number of categories per size of 
set. (N denotes the number of subjects in each 
group. The single filled circle represents the 


Random group.) 


Fic. 5. Mean number of instance categories 
per size of set. (Instance category refers to the 
number of instances placed in a category. The 
y axis represents what percentage an instance 
category is of the total number of categories made 
in each language. For example, in the 23-instance 
languages, 50 subjects made a total of 338 cate- 
gories, of which 54 categories were one-instance 
categories. ) 


We have stated above that although the 
number of instances in a set has something 
to do with the number of categories sub- 
jects make, it is not a determining factor. 
The relation of the number of basic strings 
in a set to the number of categories seems 
even less important, except when a subject 
sees the grammatical pattern accurately 
Quite frequently subjects wished to know 
how many categories they were expected to 
make. When they were informed that the 
choice of number rested with them, they 
seemed to guide their sorting by a desire 
not to have “too many” categories. This 
was especially true of subjects employing 
some type of formula strategies. “It’s a 
shame to have a one-card category” was 
not unusual for a subject to say when an 
instance remained unclassified. If, how- 
ever, the choice before the subject was be- 
ber of categories, he preferred consistency 
to number. 

Seven, then, is still a magic number. The 
“number seven, plus or minus two” de- 
notes according to Miller (1956), ‘“‘the 
span of absolute judgment” and the “span 
of memory” (p. 90). We interpret the spans 
to mean that there is a finite number of 
isolated events, or of groups of events, 
within the neighborhood of seven, which 
the human link in a communication system 
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Conceptual recoding is here discussed in 
terms of the discovery and formulation of 
rules to describe cognitive structures. Lan- 
guage provides one of the best illustrations ; 
verbal recoding of experience seems to be 
the essence of human thinking. It is to lan- 
guage that one can turn, therefore, to find 
the materials needed for experimental study 
of the processes of conceptual recoding. 
Natural languages, however, are tremen- 
dously complicated; some of the same 
processes can be studied more simply with 
artificial language systems. In the present 
study finite state systems were used to 
simulate natural languages. They are 
simpler, their formal structure can be 
stated precisely, yet they provide the kind of 
sequential patterning that is characteristic 
of all verbal communication. Subjects were 
given a set of linguistic strings, random or 
constrained, generated by some finite state 
systems; they were asked to group the 
strings in some appropriate way. In order 
to perform such a task all subjects employ 
some intelligible, recognizable plan for sort- 
ing the strings. The number of different 
strategies that are observed to occur is 
small and subjects are easily classified ac- 
cording to the strategy they adopt. Some 
strategies are more complicated than 
others ; they range from simple, ready-made 
rules for counting and alphabetizing to 
ad hoc inventions of rules that a subject 
considers implicit in the information he has 
been given. The former are more percep- 
tual, the latter more conceptual in spirit, 
but both are ways of describing the struc- 
tural rules of the system. 

Pattern conception depends upon the 
kind and amount of the evidence supplied 
by the experimenter. In the present studies, 
the length of the messages, the number of 
instances, the number of alternatives, and 
structural similarity are some of the ex- 
perimental variables by which messages 
were varied. The results show that the 
variety of symbols (number of different 
alternatives) with which the messages are 
constructed is far more critical in con- 
trolling pattern conception than is the 
length or the structure of the messages. 


Two sets of messages, identical in length, in 
rules of formation, and in the instruction 
given subjects, but differing in the number 
of symbols, elicit qualitatively different re- 
sponses. A large variety of symbols tends 
to elicit the invented strategies whereas the 
small variety generally leads to ready- 
made rules. This result indicates that the 
subject is able to recode the given informa- 
tion more effectively when there is a variety 
in the symbols from which it is composed. 

A parallel advantage is observed by in- 
creasing the amount of evidence provided 
to a subject for a given rule. The number 
of instances per rule was varied from two 
to three to four instances, while other fac- 
tors of vocabulary, grammar, and instruc- 
tions were held constant. The strategies 
that subjects used under each of these con- 
ditions were evaluated in terms of two sta- 
tistical techniques developed specifically for 
the purpose of strategy analysis. As the 
amount of evidence increased, the strate- 
gies improved. A larger quantity of evi- 
dence facilitates recoding of the sequences 
in terms of variant and invariant symbols. 

Earlier experiments on perception have 
indicated that redundancy may be of several 
kinds, and that the influence of redundancy 
upon responses varies with its quality. 
Although we have not applied an informa- 
tion measure of redundancy to our ma- 
terials (the finite state systems that we 
used are too simple to require such statisti- 
cal treatment) we have worked with a 
related conception of structural similarity 
(rules of formation) in linguistic se- 
quences. Languages varying in number and 
complexity of rules of formation, but 
similar in length and size of vocabulary, 
do not yield markedly different results. The 
fewer rules embedded in a language do not 
necessarily make that language simpler. 
Data indicate that structural redundancy is 
less critical in pattern conception than is 
the variety in symbols or the amount of 
evidence. More experiments with better 
controls on redundancy are needed to de- 
termine the effect of this variable on pat- 
tern conception. 
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Two specific difficulties were found to 
characterize the responses of many subjects 
in almost all of the experimental groups. 
These difficulties were the concept of zero 
and the concept of double variation in a 
linguistic sequence of variant and invariant 
symbols. Subjects could delete or add the 
variable from 1 to m fairly easily, but they 
struggled with the string when the exponent 
of the variable was equal to zero. In the 
case of more than one variable in a string 
subjects tended to group its instances into 
as many subsets as there were variables in 
the string. This multiple-loop difficulty is 
not unrelated to the zero difficulty, 


how 
ever. 

Pattern conception, apparently, is not a 
f any one experimental 


simple function 
variable. This point was most clearly evi 


which did not 


dent from the time measures, 


SHIPSTONE 


correlate consistently with any one experi- 
mental condition. It was reasonably clear, 
however, that the longer time measures cor- 
related positively with the better strategies. 
Presumably, the central processes of re- 
coding and thinking take time, which causes 
the delay in response. There was little evi- 
dence for an increase in time with the in- 
creasing length of the set. 

Finally, all studies led us to conclude that, 
irrespective of the type of finite state sys- 
tem used—its structure, vocabulary, length, 
or anything else—the average number of 
categories that subjects make is five or six. 
Although we were not looking for this 
point, this result is in keeping with the 
findings from some studies on “channel 
capacity,” and it indicates that thinking like 
memory is centrally affected by the channel 
capacity of the organism. 
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APPENDIX A 


AGREEMENT-DISAGREEMENT RATIO 


The Agreement-Disagreement ratio is a sta- 
tistic devised to measure the amount of correla- 
tion between sortings made by any two persons. 
The technique of calculating the amount of 
agreement between different sortings is as 
follows: 

Consider the sortings made by any two sub- 
jects X and Y: 

X—(1 2 3 45) (6 12 16) (9 14) 
(7 8 10 11 13 15) 


Y—(1 2 4 5) (3) (6 10 12 15) 
(7 89 11 13 14) (16) 


Step 1. Construct sets from the intersection 
of these two classifications : 


Intersection (1245) (612) (78 11 13) 

(9 14) (10.15) (3). (46) 
The intersection is constructed in such a way 
that every set in the data for X and Y can be 
reconstructed by combining sets from the inter- 
section, and yet the sets in the intersection are 
as large as possible. Otherwise said, every 
distinction among the strings that either X or 

Y indicated is preserved in the intersection. 


Step 2. Calculate the number of agreements 
for each set. For example, if a set contains 
four strings there are 3 + 2 + 1 = 6 pairs of 
strings that agree with one another in the 
sense that they are placed in the same class 
and considered to be equivalent. Below are 


tabulated the number of pairs of strings that 
agree as a function of the size of set. 


size number agreements 
1 0 
2 1 
3 a 
4 6 
< 10 
# x(*—1)/2 


Thus, for example, a set of 11 strings would 
yield 55 pairs that were in agreement. If size 
is + then the number of agreements is given by 
the equation A = x(x—1)/2. So the second 
step is to add up the number of agreements in 
this sense for X, Y, and the intersection: 


subject X subject Y intersection 
5 40 + 6 + 6 
3 3 1 0 1 0 
2 1 4 6 2 1 
6 15 6 15 4 6 
29 : ss = 4 
27 ae 
1 0 
15 


Step 3. Calculate the A-D ratio from these 
three numbers, 29, 27, 15, by substituting them 
into the following equation: 

x + y — 2 Intersection 
60 
a1 2+%=2(15)_ 1 _ % 
60 60 
= 1 — 443 = .567 








r=1-—- 
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APPENDIX B 
RATING SCALE FOR SUBJECTS’ STRATEGIES 


The second column gives the average scores for 
the ratings assigned to 20 strategies by five judges. 
The first column rounds off the average scores for 
purposes of establishing a 10-point rating scale. 








Final Rating Average 


Score Score* Kind of Strategy 
1 1.0 General Size 
1.5 Length 
2 Bi Initial, or Middle;, or 
Final, 
2.5 Alphabet 
3 ro | One striking word with 
Initial or Final one 
3.1 Initial. or n)» Middle orn)» 
Finale or n) 
3.1 Initial,-Final, 
3.4 Length plus Alphabet 
3.4 Initial; plus Alphabet or 
Initial plus Length 
3.6 Rhythm 
4 3.8 Mixed methods for entire 
set 
3.8 Mixed methods by groups 
4.3 Alternating word or com- 
plex counting 
5 5.3 Symmetry and Rhythm 
5.6 Initial plus complex sub- 
stitution or ordering 
6 6.2 Initiale or 3)-Finaly or 3) 
7 7.4 Maximum Identity 
8 8.2 Initial-Middle-Final 
9 9.0 Less than basic string 


(breaking up the for- 
mula by loops) 

10 10.0 Formula (same as experi- 
menter’s grammar) 








® The steps between scores are not necessarily equal. 

















